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HUMAN LIPID-ASSOCIATED PROTEINS 



TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human lipid-associated 
5 proteins and to the use of these sequences in the diagnosis, treatment, and prevention of cardiovascular, 
neurological, and gastrointestinal disorders, and disorders of lipid metabolism. 



BACKGROUND OF THE INVENTION 

Lipids are water-insoluble, oily, or greasy substances that are soluble in nonpolar solvents such 

10 as chloroform or ether. Neutral lipids (triacylglycerols) serve as major fuels and energy stores. Polar 
lipids, such as phospholipids, sphingolipids, glycolipids, and cholesterol, are key structural components 
of cell membranes. Lipids and proteins are associated in a variety of ways. Glycolipids form vesicles 
that carry proteins within cells and cell membranes. Interactions between lipids and proteins function 
in targeting proteins and glycolipids involved in a variety of processes, such as cell signaling and cell 

15 proliferation, to specific membrane and intracellular locations. Proteins are associated with the 

biosynthesis, transport, and uptake of lipids. In addition, key proteins involved in signal transduction 
and protein targeting have lipid-derived groups added to them post-translationally (Stryer, L. (1995) 
Biochemistry , W.H. Freeman and Co., New York NY, pp. 264-267, 934). 

A major class of phospholipids are the phosphoglycerides, which are composed of a glycerol 

20 backbone, two fatty acid chains, and a phosphorylated alcohol. Principal phosphoglycerides are 
phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, phosphatidylinositol, and 
diphosphatidylglycerol. Many enzymes involved in phosphoglyceride synthesis are associated with 
membranes (Meyers, R.A. (1995) Molecular Biology and Biotechnology . VCH Publishers Inc., New 
York NY, pp. 494-501 ; Stryer, supra , pp. 264-267). The enzyme phosphatidylserine decarboxylase 

25 catalyzes the conversion of phosphatidylserine to phosphatidylethanolamine, using a pyruvate cofactor. 
The two forms of yeast phosphatidylserine decarboxylase are localized to the inner mitochondrial 
membrane and to the Golgi/vacuole membrane, respectively. The mammalian enzyme, also localized 
to the inner mitochondrial membrane, is made as a proenzyme and subsequently cleaved to alpha and 
beta subunits (Voelker, D.R. (1997) Biochim. Biophys. Acta 1348:236-244). 

30 Cholesterol, composed of four fused hydrocarbon rings with an alcohol at one end, moderates 

the fluidity of membranes in which it is incorporated. In addition, cholesterol is used in the synthesis 
of such hormones as Cortisol, progesterone, estrogen, and testosterone. Bile salts derived from 
cholesterol facilitate the digestion of lipids. Cholesterol in the skin forms a barrier that prevents excess 
water evaporation from the body. Farnesyl and geranylgeranyl groups, which are derived from 

35 cholesterol biosynthesis intermediates, are post-translationally added to signal transduction proteins 
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such as ras and protein-targeting proteins such as rab. These modifications are important for the 
activities of these proteins (Guyton, A.C. Textbook of Medical Physiology (1991) W.B. Saunders 
Company, Philadelphia PA, pp.760-763; Stryer, supra , pp. 279-280, 691-702, 934). 

Mammals obtain cholesterol derived from both de novo biosynthesis and the diet. The liver is 

5 the major site of cholesterol biosynthesis in mammals. Biosynthesis is accomplished via a series of 
enzymatic steps known as the mevalonate pathway. The rate-limiting step is the conversion of 
hydroxymethylglutaryl-Coenzyme A (HMG-CoA) to mevalonate by HMG-CoA reductase. The drug 
lovastatin, a potent inhibitor of HMG-CoA reductase, is given to patients to reduce their serum 
cholesterol levels. Cholesterol derived from de novo biosynthesis or from the diet is transported in the 

10 body fluids in the form of lipoprotein particles. These particles also transport triacylglycerols. The 
particles consist of a core of hydrophobic lipids surrounded by a shell of polar lipids and 
apolipoproteins. The protein components serve in the solubilization of hydrophobic lipids and also 
contain cell-targeting signals. Lipoproteins include chylomicrons, chylomicron remnants, very-low- 
density lipoproteins (VLDL), intermediate-density lipoproteins (IDL), low-density lipoproteins (LDL), 

15 and high-density lipoproteins (HDL) (Meyers, supra ; Stryer, supra , pp. 69 1-702). There is a strong 
inverse correlation between the levels of plasma HDL and risk of premature coronary heart disease. 
ApoL is an HDL apolipoprotein expressed in the pancreas (Duchateau, P.N. et al. (1997) J. Biol. Chem. 
272:25576-25582). 

Most cells outside the liver and intestine take up cholesterol from the blood rather than 

20 synthesize it themselves. Cell surface LDL receptors bind LDL particles which are then internalized by 
endocytosis (Meyers, supra ). Absence of the LDL receptor, the cause of the disease familial 
hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to atherosclerosis 
(Stryer, supra , pp. 69 1 -702). 

Proteins involved in cholesterol uptake and biosynthesis are tightly regulated in response to 

25 cellular cholesterol levels. The sterol regulatory element binding protein (SREBP) is a sterol- 
responsive transcription factor. Under normal cholesterol conditions, SREBP resides in the 
endoplasmic reticulum membrane. When cholesterol levels are low, a regulated cleavage of SREBP 
occurs which releases the extracellular domain of the protein. This cleaved domain is then transported 
to the nucleus where it activates the transcription of the LDL receptor gene, and genes encoding 

30 enzymes of cholesterol synthesis, by binding the sterol regulatory element (SRE) upstream of the genes 
(Yang, J. etal. (1995) J. Biol. Chem. 270:12152-12161). Regulation of cholesterol uptake and 
biosynthesis also occurs via the oxysterol-binding protein (OSBP). Oxysterols are oxidation products 
formed during the catabolism of cholesterol, and are involved in regulation of steroid biosynthesis. 
OSBP is a high-affinity intracellular receptor for a variety of oxysterols that down-regulate cholesterol 

35 synthesis and stimulate cholesterol esterification (Lagace, T.A. et al. (1997) Biochem. J. 326:205-213). 
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The copines are phospholipid-binding proteins believed to function in membrane trafficking. 
Copines promote lipid vesicle aggregation. They contain a C2 domain associated with membrane 
activity and an annexin-type domain that mediates interactions between integral and extracellular 
proteins and is associated with calcium binding and regulation (Creutz, C.E. (1998) J. Biol. Chem. 
5 273:1393-1402). Other C2-containing proteins include the synaptotagmins, a family of proteins 

involved in vesicular trafficking. Synaptotagmin concentrations in cerebrospinal fluid have been found 
to be reduced in early-onset Alzheimer's disease (Gottfries, C.G. et al. (1998) J. Neural Transm. 
105:773-786). 

Lipids and their associated proteins have roles in human diseases and disorders. Increased 

10 synthesis of long-chain fatty acids occurs in neoplasms including those of the breast, prostate, ovary, 
colon and endometrium. There is a strong inverse correlation between the levels of plasma HDL and 
risk of premature coronary heart disease. Absence of the LDL receptor, the cause of familial 
hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to atherosclerosis 
(Stryer, supra , pp. 691-702). The arterial disease atherosclerosis is characterized by the formation of 

15 fatty lesions on the inside of the arterial wall. These lesions promote the loss of arterial flexibility and 
the formation of blood clots (Guyton, supra ). Oxysterols are present in human atherosclerotic plaques 
and believed to play an active role in plaque development (Brown, A.J. (1999) Atherosclerosis 142:1- 
28). Steatosis, or fatty liver, is characterized by the accumulation of triglycerides in the liver and may 
occur in association with a variety of conditions including alcoholism, diabetes, obesity, and prolonged 

20 parenteral nutrition. Steatosis may lead to fibrosis and cirrhosis of the liver. In Tay-Sachs disease, the 
GM 2 ganglioside (a sphingolipid) accumulates in lysosomes of the central nervous system due to a lack 
of the enzyme N-acetylhexosaminidase. Patients suffer nervous system degeneration leading to early 
death (Fauci, A.S. et al. (1998) Harrison's Principles of Internal Medicine McGraw-Hill, New York NY 
p. 2171). The Niemann-Pick diseases are caused by defects in lipid metabolism. Niemann-Pick 

25 diseases types A and B are caused by accumulation of sphingomyelin (a sphingolipid) and other lipids 
in the central nervous system due to a defect in the enzyme sphingomyelinase, leading to 
neurodegeneration and lung disease. Niemann-Pick disease type C results from a defect in cholesterol 
transport, leading to the accumulation of sphingomyelin and cholesterol in lysosomes and a secondary 
reduction in sphingomyelinase activity. Neurological symptoms such as grand mal seizures, ataxia, and 

30 loss of previously learned speech, manifest 1-2 years after birth. A mutation in the NPC protein, which 
contains a putative cholesterol-sensing domain, was found in a mouse model of Niemann-Pick disease 
type C (Fauci, supra , p. 2175; Loftus, S.K. et al. (1997) Science 277:232-235). 

The discovery of new human lipid-associated proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 

35 and treatment of cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid 
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metabolism. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, human lipid-associated proteins, referred to 
5 collectively as "LIPAP" and individually as "LIPAP-1," "L1PAP-2," "LIPAP-3," "LIPAP-4," "LIPAP- 
5," "LIPAP-6," "LIPAP-7," "LIPAP-8," "LIPAP-9," "LIPAP-10," "LIPAP-1 1," and "LIPAP-12." In 
one aspect, the invention provides an isolated polypeptide comprising a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-12, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 

10 SEQ ID NO: 1-12, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-1 2, or d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-1 2. In one alternative, the invention provides an isolated 
polypeptide comprising the amino acid sequence of SEQ ID NO: 1-1 2. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising a) 

15 an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -12, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-12, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1 -1 2, or d) an immunogenic fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-12. In one alternative, 

20 the polynucleotide is selected from the group consisting of SEQ ID NO: 1 3-24. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-12, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 

25 consisting of SEQ ID NO: 1-12, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-1 2, or d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-12. In one alternative, the invention 
provides a cell transformed with the recombinant polynucleotide. In another alternative, the invention 
provides a transgenic organism comprising the recombinant polynucleotide. 

30 The invention also provides a method for producing a polypeptide comprising a) an amino acid 

sequence selected from the group consisting of SEQ ID NO: 1- 12, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-1 2, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-12, or d) an immunogenic fragment of an amino acid 

35 sequence selected from the group consisting of SEQ ID NO: 1-12. The method comprises a) culturing a 

4 
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cell under conditions suitable for expression of the polypeptide, wherein said cell is transformed with a 
recombinant polynucleotide comprising a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
5 polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 - 
12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-12, c) a biologically active fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-12, or d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1 - 
10 12. 

The invention further provides an isolated polynucleotide comprising a) a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 13-24, b) a naturally occurring 
polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 13-24, c) a polynucleotide sequence complementary to a), or 

15 d) a polynucleotide sequence complementary to b). In one alternative, the polynucleotide comprises at 
least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a) a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 13-24, b) a naturally occurring 

20 polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 13-24, c) a polynucleotide sequence complementary to a), or 
d) a polynucleotide sequence complementary to b). The method comprises a) hybridizing the sample 
with a probe comprising at least 16 contiguous nucleotides comprising a sequence complementary to 
said target polynucleotide in the sample, and which probe specifically hybridizes to said target 

25 polynucleotide, under conditions whereby a hybridization complex is formed between said probe and 
said target polynucleotide, and b) detecting the presence or absence of said hybridization complex, and 
optionally, if present, the amount thereof. In one alternative, the probe comprises at least 30 contiguous 
nucleotides. In another alternative, the probe comprises at least 60 contiguous nucleotides. 

The invention further provides a pharmaceutical composition comprising an effective amount 

30 of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-12, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-1 2, or d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:l- 

35 12, and a pharmaceutically acceptable excipient. The invention additionally provides a method of 
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treating a disease or condition associated with decreased expression of functional LIPAP, comprising 
administering to a patient in need of such treatment the pharmaceutical composition. 

The invention also provides a method for screening a compound for effectiveness as an agonist 
of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID 
5 NO: 1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-1 2, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-12, or d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1- 
12. The method comprises a) exposing a sample comprising the polypeptide to a compound, and b) 

10 detecting agonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
composition comprising an agonist compound identified by the method and a pharmaceutical^ 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with decreased expression of functional LIPAP, comprising administering to a 
patient in need of such treatment the pharmaceutical composition. 

15 Additionally, the invention provides a method for screening a compound for effectiveness as an 

antagonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-12, b) a naturally occurring amino acid sequence having at least 90% sequence identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-12, c) a biologically 
active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-1 2, or 

20 d) an immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-12. The method comprises a) exposing a sample comprising the polypeptide to a compound, and 
b) detecting antagonist activity in the sample. In one alternative, the invention provides a 
pharmaceutical composition comprising an antagonist compound identified by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 

25 treating a disease or condition associated with overexpression of functional LIPAP, comprising 
administering to a patient in need of such treatment the pharmaceutical composition. 

The invention further provides a method for screening a compound for effectiveness in altering 
expression of a target polynucleotide, wherein said target polynucleotide comprises a sequence selected 
from the group consisting of SEQ ID NO: 13-24, the method comprising a) exposing a sample 

30 comprising the target polynucleotide to a compound, and b) detecting altered expression of the target 
polynucleotide. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
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length sequences encoding LIPAP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of LIPAP. 

Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 
5 patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
conditions associated with these tissues; and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding LIPAP were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze LEPAP, along with 
10 applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 

that this invention is not limited to the particular machines, materials and methods described, as these 
15 may vary. It is also to be understood that the terminology used herein is for the purpose of describing 

particular embodiments only, and is not intended to limit the scope of the present invention which will 

be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 

and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
20 reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 

reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 

forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 

25 Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing the 
cell lines, protocols, reagents and vectors which are reported in the publications and which might be 
used in connection with the invention. Nothing herein is to be construed as an admission that the 

30 invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"LIPAP" refers to the amino acid sequences of substantially purified LIPAP obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 
35 The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 

7 
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LIPAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of LIPAP either by directly interacting with 
LIPAP or by acting on components of the biological pathway in which LIPAP participates. 

An "allelic variant" is an alternative form of the gene encoding LIPAP. Allelic variants may 

5 result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 

10 in a given sequence. 

"Altered" nucleic acid sequences encoding LIPAP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as LIPAP or a 
polypeptide with at least one functional characteristic of LIPAP. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 

15 the polynucleotide encoding LIPAP, and improper or unexpected hybridization to allelic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide sequence encoding LIPAP. 
The encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of 
amino acid residues which produce a silent change and result in a functionally equivalent LIPAP. 
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, 

20 solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the 
biological or immunological activity of LIPAP is retained. For example, negatively charged amino 
acids may include aspartic acid and glutamic acid, and positively charged amino acids may include 
lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity 
values may include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged 

25 side chains having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine 
and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to an amino acid sequence of a naturally 

30 occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 
in the art. 

35 The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
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of LIPAP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of LIPAP either by 
directly interacting with LIPAP or by acting on components of the biological pathway in which LIPAP 
participates. 

5 The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 

such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind LIPAP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 

10 synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole 
limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

15 immunize a host animal, numerous regions of the protein may induce the production of antibodies 

which bind specifically to antigenic determinants (particular regions or three-dimensional structures on 
the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used 
to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" strand 

20 of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; peptide nucleic 
acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, 
methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2'- 
methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having modified bases such as 5- 
methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense molecules may be produced 

25 by any method including chemical synthesis or transcription. Once introduced into a cell, the 
complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence 
produced by the cell to form duplexes which block either transcription or translation. The designation 
"negative" or "minus" can refer to the antisense strand, and the designation "positive" or "plus" can 
refer to the sense strand of a reference DNA molecule. 

30 The term "biologically active" refers to a protein having structural, regulatory, or biochemical 

functions of a naturally occurring molecule. Likewise, "immunologically active" refers to the 
capability of the natural, recombinant, or synthetic LIPAP, or of any oligopeptide thereof, to induce a 
specific immune response in appropriate animals or cells and to bind with specific antibodies. 
The terms "complementary" and "complementarity" refer to the natural binding of 

35 polynucleotides by base pairing. For example, the sequence "5' A-G-T 3"' bonds to the complementary 
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sequence "3' T-C-A 5'." Complementarity between two single-stranded molecules may be "partial," 
such that only some of the nucleic acids bind, or it may be "complete," such that total complementarity 
exists between the single stranded molecules. The degree of complementarity between nucleic acid 
strands has significant effects on the efficiency and strength of the hybridization between the nucleic 
5 acid strands. This is of particular importance in amplification reactions, which depend upon binding 
between nucleic acid strands, and in the design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 

10 Compositions comprising polynucleotide sequences encoding LIPAP or fragments of LIPAP may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

15 "Consensus sequence" refers to a nucleic acid sequence which has been resequenced to resolve 

uncalled bases, extended using the XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or the 3' 
direction, and resequenced, or which has been assembled from the overlapping sequences of one or 
more Incyte Clones and, in some cases, one or more public domain ESTs, using a computer program for 
fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison WI). Some 

20 sequences have been both extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that, when made, least interfere 
with the properties of the original protein, i.e., the structure and especially the function of the protein is 
conserved and not significantly changed by such substitutions. The table below shows amino acids 
which may be substituted for an original amino acid in a protein and which are regarded as conservative 

25 amino acid substitutions. 



Original Residue 



Conservative Substitution 



35 



30 



Ala 

Arg 

Asn 

Asp 

Cys 

Gin 

Glu 

Gly 

His 

He 

Leu 

Lys 

Met 



Arg, Gin, Glu 
Leu, He 



Leu, Val 
He, Val 



Asn, Glu, His 
Asp, Gin, His 
Ala 

Asn, Arg, Gin, Glu 



Asn, Glu 
Ala, Ser 



Asp, Gin, His 



Gly, Ser 
His, Lys 
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5 



Phe 
Ser 
Thr 
Trp 
Tyr 
Val 



His, Phe, Trp 
He, Leu, Thr 



Cys, Thr 
Ser, Val 
Phe, Tyr 



His, Met, Leu, Trp, Tyr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
10 (b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 



15 polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, hydroxyl, or amino group. A derivative 
polynucleotide encodes a polypeptide which retains at least one biological or immunological function 
of the natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide from 

20 which it was derived. 

A "fragment" is a unique portion of LIPAP or the polynucleotide encoding LIPAP which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up to 
the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 

25 used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 
20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid residues 
in length. Fragments may be preferentially selected from certain regions of a molecule. For example, 
a polypeptide fragment may comprise a certain length of contiguous amino acids selected from the first 
250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain defined sequence. 

30 Clearly these lengths are exemplary, and any length that is supported by the specification, including the 
Sequence Listing, tables, and figures, may be encompassed by the present embodiments. 

A fragment of SEQ ID NO: 13-24 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO: 13-24, for example, as distinct from any other sequence in the same 
genome. A fragment of SEQ ID NO: 13-24 is useful, for example, in hybridization and amplification 

35 technologies and in analogous methods that distinguish SEQ ID NO: 1 3-24 from related polynucleotide 
sequences. The precise length of a fragment of SEQ ID NO: 13-24 and the region of SEQ ID NO: 13-24 
to which the fragment corresponds are routinely determinable by one of ordinary skill in the art based 



The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
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on the intended purpose for the fragment. 

A fragment of SEQ ID NO:l-12 is encoded by a fragment of SEQ ID NO:13-24. A fragment 
of SEQ ID NO: 1-1 2 comprises a region of unique amino acid sequence that specifically identifies SEQ 
ID NO: 1-12. For example, a fragment of SEQ ID NO: 1-12 is useful as an immunogenic peptide for the 
5 development of antibodies that specifically recognize SEQ ID NO: 1 - 1 2. The precise length of a 
fragment of SEQ ID NO: 1-1 2 and the region of SEQ ID NO: 1-12 to which the fragment corresponds 
are routinely determinable by one of ordinary skill in the art based on the intended purpose for the 
fragment. 

The term "similarity" refers to a degree of complementarity. There may be partial similarity or 
10 complete similarity. The word "identity" may substitute for the word "similarity." A partially 
complementary sequence that at least partially inhibits an identical sequence from hybridizing to a 
target nucleic acid is referred to as "substantially similar." The inhibition of hybridization of the 
completely complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or northern blot, solution hybridization, and the like) under conditions of reduced 
15 stringency. A substantially similar sequence or hybridization probe will compete for and inhibit the 
binding of a completely similar (identical) sequence to the target sequence under conditions of reduced 
stringency. This is not to say that conditions of reduced stringency are such that non-specific binding is 
permitted, as reduced stringency conditions require that the binding of two sequences to one another be 
a specific (i.e., a selective) interaction. The absence of non-specific binding may be tested by the use 

20 of a second target sequence which lacks even a partial degree of complementarity (e.g., less than about 
30% similarity or identity). In the absence of non-specific binding, the substantially similar sequence 
or probe will not hybridize to the second non-complementary target sequence. 

The phrases "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 

25 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence 

30 alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5: 151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. For 
pairwise alignments of polynucleotide sequences, the default parameters are set as follows: Ktuple=2, 
gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected 

35 as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between aligned 
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polynucleotide sequence pairs. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from 
5 several sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
10 Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example: 
15 Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 
20 Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 

25 the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

30 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 

35 the percentage of residue matches between at least two polypeptide sequences aligned using a 
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standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the hydrophobicity and acidity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 
5 Percent identity between polypeptide sequences may be determined using the default 

parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence 
alignment program (described and referenced above). For pairwise alignments of polypeptide 
sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l , gap penalty=3, 
window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default residue weight 
10 table. As with polynucleotide alignments, the percent identity is reported by CLUSTAL V as the 
"percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) with blastp set at default parameters. Such default parameters may be, for example: 
15 Matrix: BLOSUM62 

Open Gap: ] 1 and Extension Gap: 1 penalties 

Gap x drop -off: 50 

Expect: 10 

Word Size: 3 
20 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
25 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
30 stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid sequence 
in the non-antigen binding regions has been altered so that the antibody more closely resembles a 
human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
35 complementary strand through base pairing under defined hybridization conditions. Specific 
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hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific 
hybridization complexes form under permissive annealing conditions and remain hybridized after the 
"washing" step(s). The washing step(s) is particularly important in determining the stringency of the 
hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding 
5 between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 
0 SDS, and about 100 ug/ml denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Generally, such wash temperatures are selected to be about 
5°C to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
5 target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 
for nucleic acid hybridization are well known and can be found in Sambrook et al., 1989, Molecular 
Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 

20 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as formamide at a 

25 concentration of about 35-50% v/v, may also be used under particular circumstances, such as for 

RNA:DNA hybridizations. Useful variations on these wash conditions will be readily apparent to those 
of ordinary skill in the art. Hybridization, particularly under high stringency conditions, may be 
suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative of 
a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to 
which cells or their nucleic acids have been fixed). 
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The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
5 various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 

The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 
The terms "element" and "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 
10 The term "modulate" refers to a change in the activity of LIPAP. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of LIPAP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
15 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
20 sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 
where necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
25 preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Probe" refers to nucleic acid sequences encoding LIPAP, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are isolated 
oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical labels 
30 include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are short 
nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 
35 Probes and primers as used in the present invention typically comprise at least 1 5 contiguous 
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nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
5 specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel et al.,1987, Current Proto cols in Molecular Biology. 
Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols, A Guide 
10 to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs can be derived from a 
known sequence, for example, by using computer programs intended for that purpose such as Primer 
(Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
15 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
20 and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a "misprimmg library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides 
for microarrays. (The source code for the latter two primer selection programs may also be obtained 
25 from their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
30 polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

35 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
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that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
5 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
10 expressed, inducing a protective immunological response in the mammal. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

15 The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 

acids encoding LIPAP, or fragments thereof, or LIPAP itself, may comprise a bodily fluid; an extract 
from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 

20 protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide containing the epitope 
A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 

25 reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

30 A "substitution" refers to the replacement of one or more amino acids or nucleotides by 

different amino acids or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 

35 trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

18 



WO 00/49043 



PCTYUS00/04160 



"Transformation" describes a process by which exogenous DNA enters and changes a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host ceil. The method for transformation is selected based on the type 
5 of host cell being transformed and may include, but is not limited to, viral infection, electroporation, 
heat shock, lipofection, and particle bombardment. The term "transformed" cells includes stably 
transformed cells in which the inserted DNA is capable of replication either as an autonomously 
replicating plasmid or as part of the host chromosome, as well as transiently transformed cells which 
express the inserted DNA or RNA for limited periods of time. 

10 A "transgenic organism," as used herein, is any organism, including but not limited to animals 

and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic techniques well known in the art. The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 

15 recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro 
fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The transgenic 
organisms contemplated in accordance with the present invention include bacteria, cyanobacteria, 
fungi, and plants and animals. The isolated DNA of the present invention can be introduced into the 
host by methods known in the art, for example infection, transfection, transformation or 

20 transconjugation. Techniques for transferring the DNA of the present invention into such organisms 
are widely known and provided in references such as Sambrook et al. (1989), supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 

25 set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 

30 polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 

polypeptide may possess additional functional domains or lack domains that are present in the reference 
molecule. Species variants are polynucleotide sequences that vary from one species to another. The 
resulting polypeptides generally will have significant amino acid identity relative to each other. A 
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 

35 individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
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polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 
5 least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 807c, at least 90%, at least 95%, or at least 98% or greater sequence identity 
over a certain defined length of one of the polypeptides. 
10 THE INVENTION 

The invention is based on the discovery of new human lipid-associated proteins (LIPAP), the 
polynucleotides encoding LIPAP, and the use of these compositions for the diagnosis, treatment, or 
. prevention of cardiovascular, neurological, and gastrointestinal disorders, and disorders of lipid 
metabolism. 

15 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 

LIPAP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 
nucleic acids encoding each LIPAP were identified, and column 4 shows the cDNA libraries from 
which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA 

20 libraries. Clones for which cDNA libraries are not indicated were derived from pooled cDNA libraries. 
The Incyte clones in column 5 were used to assemble the consensus nucleotide sequence of each LIPAP 
and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 

25 polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis and the identity of each polypeptide; 
and column 7 shows analytical methods and in some cases, searchable databases to which the analytical 
methods were applied. The methods of column 7 were used to characterize each polypeptide through 

30 sequence homology and protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding LIPAP. The first column of Table 3 lists the nucleotide 
SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These fragments are 
useful, for example, in hybridization or amplification technologies to identify SEQ ID NO: 1 3-24 and to 

35 distinguish between SEQ ID NO: 13-24 and related polynucleotide sequences. The polypeptides 
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encoded by these fragments are useful, for example, as immunogenic peptides. Column 3 lists tissue 
categories which express LIPAP as a fraction of total tissues expressing LIPAP. Column 4 lists 
diseases, disorders, or conditions associated with those tissues expressing LIPAP as a fraction of total 
tissues expressing LIPAP. Column 5 lists the vectors used to subclone each cDNA library. 
5 The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 

from which cDNA clones encoding LIPAP were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ID NO:2 1 maps to chromosome 1 1 within the interval from 92.5 to 96.3 centiMorgans . 
10 This interval also contains a gene encoding a G-protein coupled receptor associated with epilepsy. 

The invention also encompasses LIPAP variants. A preferred LIPAP variant is one which has 
at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the LIPAP amino acid sequence, and which contains at least one functional or structural 
characteristic of LEPAP. 

15 The invention also encompasses polynucleotides which encode LIPAP. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO: 13-24, which encodes LIPAP. The polynucleotide sequences 
of SEQ ID NO: 13-24, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 

20 is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding LIPAP. In 
particular, such a variant polynucleotide sequence will have at least about 80%, or alternatively at least 
about 90%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding LIPAP. A particular aspect of the invention encompasses a variant of a polynucleotide 

25 sequence comprising a sequence selected from the group consisting of SEQ ID NO: 13-24 which has at 
least about 80%, or alternatively at least about 90%, or even at least about 95% polynucleotide 
sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO: 13-24. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of LIPAP. 

30 It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 

code, a multitude of polynucleotide sequences encoding LIPAP, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be 
made by selecting combinations based on possible codon choices. These combinations are made in 

35 accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
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occurring LIPAP, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode LIPAP and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring LIPAP under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding LIPAP or 

5 its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding LIPAP and its derivatives without altering the encoded amino acid sequences 

10 include the production of RNA transcripts having more desirable properties, such as a greater half-life, 
than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode LIPAP and 
LIPAP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 

15 using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding LIPAP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO: 13-24 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 

20 S.L. Berger (1987) Methods Enzymol. 152:399-407: Kimmel, A.R. (1987) Methods Enzymol. 152:507- 
511.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the invention. The methods may employ such enzymes as the Klcnow fragment of 

25 DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Perkin-Elmer), 
thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or combinations of 
polymerases and proofreading exonucleases such as those found in the ELONGASE amplification 
system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is automated with 
machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal 

30 cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Perkin-Elmer). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Perkin- 
Elmer), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA), or 
other systems known in the art. The resulting sequences are analyzed using a variety of algorithms 
which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short Protocols in Molecular 

35 Biology . John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and 
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Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding LIPAP may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
5 restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) Another 
method, inverse PCR, uses primers that extend in divergent directions to amplify unknown sequence 
from a circularized template. The template is derived from restriction fragments comprising a known 
genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 
10 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent to 
known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-1 19.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
15 are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 

Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, 
20 Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 
25 does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
30 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), and the entire process 
from loading of samples to computer analysis and electronic data display may be computer controlled. 
Capillary electrophoresis is especially preferable for sequencing small DNA fragments which may be 
35 present in limited amounts in a particular sample. 
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In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode LIPAP may be cloned in recombinant DNA molecules that direct expression of LIPAP, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy 
of the genetic code, other DNA sequences which encode substantially the same or a functionally 
5 equivalent amino acid sequence may be produced and used to express LIPAP. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter LIPAP-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 

10 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such as 
MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 

15 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve 
the biological properties of LIPAP, such as its biological or enzymatic activity or its ability to bind to 
other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 

20 selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 
recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 

25 fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

In another embodiment, sequences encoding LIPAP may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 

30 Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
LIPAP itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solid-phase techniques. (See, e.g., Roberge, J.Y. et al. (1995) 
Science 269:202-204.) Automated synthesis may be achieved using the ABI 431 A peptide synthesizer 
(Perkin-Elmer). Additionally, the amino acid sequence of LIPAP, or any part thereof, may be altered 

35 during direct synthesis and/or combined with sequences from other proteins, or any part thereof, to 
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produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
5 (See, e.g., Creighton, T. (1984) Proteins, Structures and Molecular Properties , WH Freeman, New York 
NY.) 

In order to express a biologically active LIPAP, the nucleotide sequences encoding LIPAP or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 

10 suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3 'untranslated regions in the vector and in polynucleotide sequences 
encoding LIPAP. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding LIPAP. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 

15 sequences encoding LIPAP and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals including an in-frame ATG initiation codon should be provided by the 
vector. Exogenous translational elements and initiation codons may be of various origins, both natural 

20 and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate 
for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding LIPAP and appropriate transcriptional and translational control 

25 elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular C loning. A Laboratory 
Manual , Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 

30 encoding LIPAP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 

35 animal cell systems. The invention is not limited by the host cell employed. 
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In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding LIPAP. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding LIPAP can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
5 (Life Technologies). Ligation of sequences encoding LIPAP into the vector's multiple cloning site 
disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 
deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 

10 264:5503-5509.) When large quantities of LIPAP are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of LIPAP may be used. For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of LIPAP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 

15 promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra ; 
Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, CA. et al. (1994) 
Bio/Technology 12:181-184.) 

20 Plant systems may also be used for expression of LIPAP. Transcription of sequences encoding 

LIPAP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 

25 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) These constructs can 
be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, 
e.g., The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
30 where an adenovirus is used as an expression vector, sequences encoding LIPAP may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses LIPAP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
35 virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
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based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 

5 or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
LIPAP in cell lines is preferred. For example, sequences encoding LIPAP can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 

10 introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

15 Any number of selection systems may be used to recover transformed cell lines. These include, 

but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tk~ and apr cells, respectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1:223-232; 
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 

20 resistance to the aminoglycosides neomycin and G-41 8; and als and pat confer resistance to 

chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et al. (1980) 
Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 

25 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), 6 
glucuronidase and its substrate 6-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. 
(1995) Methods Mol. Biol. 55:121-131.) 

30 Although the presence/absence of marker gene expression suggests that the gene of interest is 

also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding LIPAP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding LIPAP can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding LIPAP under the control of a single 

35 promoter. Expression of the marker gene in response to induction or selection usually indicates 
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expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding LIPAP and that express 
LIPAP may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 

5 amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of LIPAP using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 

10 activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on LIPAP is preferred, but a competitive binding 
assay may be employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, Sect. IV; Coligan, J.E. 
et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley-Interscience, New 

15 York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding LIPAP include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 

20 Alternatively, the sequences encoding LIPAP, or any fragments thereof, may be cloned into a vector for 
the production of an mRNA probe. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as 
T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 

25 (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease 
of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding LIPAP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 

30 produced by a transformed cell may be secreted or retained intracellular^ depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode LIPAP may be designed to contain signal sequences which direct 
secretion of LIPAP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
35 inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
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polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
5 (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 
Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and 
processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding LIPAP may be ligated to a heterologous sequence resulting in translation of a 

10 fusion protein in any of the aforementioned host systems. For example, a chimeric LIPAP protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of LIPAP activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 

15 maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c- 
myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate 
fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate 
resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of 
fusion proteins using commercially available monoclonal and polyclonal antibodies that specifically 

20 recognize these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage 
site located between the LIPAP encoding sequence and the heterologous protein sequence, so that 
LIPAP may be cleaved away from the heterologous moiety following purification. Methods for fusion 
protein expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of 
commercially available kits may also be used to facilitate expression and purification of fusion 

25 proteins. 

In a further embodiment of the invention, synthesis of radiolabeled LIPAP may be achieved in 
vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, 
or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 

30 example, 35 S-methionine. 

Fragments of LIPAP may be produced not only by recombinant means, but also by direct 
peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) Protein 
synthesis may be performed by manual techniques or by automation. Automated synthesis may be 
achieved, for example, using the ABI 431 A peptide synthesizer (Perkin-Elmer). Various fragments of 

35 LIPAP may be synthesized separately and then combined to produce the full length molecule. 
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THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 
regions of LIPAP and human lipid-associated proteins. In addition, the expression of LIPAP is closely 
associated with cardiovascular and gastrointestinal tissues, and tissues of the nervous system. 
5 Therefore, LIPAP appears to play a role in cardiovascular, neurological, and gastrointestinal disorders, 
and disorders of lipid metabolism. In the treatment of disorders associated with increased LIPAP 
expression or activity, it is desirable to decrease the expression or activity of LIPAP. In the treatment 
of disorders associated with decreased LIPAP expression or activity, it is desirable to increase the 
expression or activity of LIPAP. 
10 Therefore, in one embodiment, LIPAP or a fragment or derivative thereof may be administered 

to a subject to treat or prevent a disorder associated with decreased expression or activity of LIPAP. 
Examples of such disorders include, but are not limited to, a cardiovascular disorder including blood 
vessel disorders such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, Raynaud's 
disease, aneurysms, arterial dissections, varicose veins, thrombophlebitis and phlebothrombosis, 
15 vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular replacement, and 
coronary artery bypass graft surgery; heart disorders such as congestive heart failure, ischemic heart 
disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular heart 
disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mitral annular calcification, 
mitral valve prolapse, rheumatic fever and rheumatic heart disease, infective endocarditis, nonbacterial 
20 thrombotic endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, 
cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, congenital heart disease, and 
complications of cardiac transplantation; and lung disorders such as congenital lung anomalies, 
atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary hemorrhage, 
pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive pulmonary disease, 
25 restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic bronchitis, 
bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, lung 
abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, idiopathic 
pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, pulmonary 
eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary hemorrhage syndromes, 
30 Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary involvement in 
collagen-vascular disorders, pulmonary alveolar proteinosis, lung tumors, inflammatory and 
noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced lung disease, 
radiation-induced lung disease, and complications of lung transplantation; a neurological disorder such 
as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's 
35 disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
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amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 
retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 
and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease; prion diseases including 
5 kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome; fatal familial 
insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous 
sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorders of the central nervous system, cerebral palsy, neuroskeletal 
disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular 
10 dystrophy and other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis 
and polymyositis; inherited, metabolic, endocrine, and toxic myopathies; myasthenia gravis, periodic 
paralysis; mental disorders including mood, anxiety, and schizophrenic disorders; seasonal affective 
disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, 
paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, 
15 corticobasal degeneration, and familial frontotemporal dementia; a gastrointestinal disorder such as 
dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal carcinoma, 
dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, gastroparesis, antral or 
pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal obstruction, infections of the 
intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, 
20 biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the liver, hepatoma, 
infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's disease, Mallory- 
Weiss syndrome, colonic carcinoma, colonic obstruction, irritable bowel syndrome, short bowel 
syndrome, diarrhea, constipation, gastrointestinal hemorrhage, acquired immunodeficiency syndrome 
(AIDS) enteropathy, jaundice, hepatic encephalopathy, hepatorenal syndrome, hepatic steatosis, 
25 hemochromatosis, Wilson's disease, alpha,-antitrypsin deficiency, Reye's syndrome, primary 

sclerosing cholangitis, liver infarction, portal vein obstruction and thrombosis, centrilobular necrosis, 
peliosis hepatis, hepatic vein thrombosis, veno-occlusive disease, preeclampsia, eclampsia, acute fatty 
liver of pregnancy, intrahepatic cholestasis of pregnancy, and hepatic tumors including nodular 
hyperplasias, adenomas, and carcinomas; and a disorder of lipid metabolism such as fatty liver, 
30 cholestasis, primary biliary cirrhosis, carnitine deficiency, carnitine palmitoyltransferase deficiency, 
myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabry's disease, 
Gaucher' s disease, Niemann-Pick's disease, metachromatic leukodystrophy, adrenoleukodystrophy, 
GM 2 gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier disease, 
hyperlipoproteinemia, diabetes mellitus, lipodystrophy, lipomatoses, acute panniculitis, disseminated 
35 fat necrosis, adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, lipomas, 



WO 00/49043 



PO7US00/04160 



atherosclerosis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, primary 
hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithinxholesterol 
acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolerma, Tay- 
Sachs disease, Sandhoff's disease, hyperlipidemia, hyperlipemia, lipid myopathies, and obesity. 
5 In another embodiment, a vector capable of expressing LIPAP or a fragment or derivative 

thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of LIPAP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
LIPAP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
10 or prevent a disorder associated with decreased expression or activity of LIPAP including, but not 
limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of LIPAP may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of LIPAP including, but not limited to, those listed above. 
15 In a further embodiment, an antagonist of LIPAP may be administered to a subject to treat or 

prevent a disorder associated with increased expression or activity of LIPAP. Examples of such 
disorders include, but are not limited to, those cardiovascular, neurological, and gastrointestinal 
disorders, and disorders of lipid metabolism, described above. In one aspect, an antibody which 
specifically binds LIPAP may be used directly as an antagonist or indirectly as a targeting or delivery 
20 mechanism for bringing a pharmaceutical agent to cells or tissues which express LIPAP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding LIPAP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of LIPAP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
25 sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
30 lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of LIPAP may be produced using methods which are generally known in the art. 
In particular, purified LIPAP may be used to produce antibodies or to screen libraries of pharmaceutical 
agents to identify those which specifically bind LIPAP. Antibodies to LIPAP may also be generated 
using methods that are well known in the art. Such antibodies may include, but are not limited to, 
35 polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced 
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by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are 
generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and 
others may be immunized by injection with LIPAP or with any fragment or oligopeptide thereof which 

5 has immunogenic properties. Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels such as 
aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli 
Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

10 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to LIPAP 

have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
identical to a portion of the amino acid sequence of the natural protein and contain the entire ammo 
acid sequence of a small, naturally occurring molecule. Short stretches of LIPAP amino acids may be 

15 fused with those of another protein, such as KLH, and antibodies to the chimeric molecule may be 
produced. 

Monoclonal antibodies to LIPAP may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
20 technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
25 antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, S. 
et al. (1985) Nature 3 14:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce LIPAP-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
30 generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, 
D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, G 
35 et al. (1991) Nature 349:293-299.) 
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Antibody fragments which contain specific binding sites for LIPAP may also be generated. For 
example, such fragments include, but are not limited to, ¥(ab\ fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab)2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
5 easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et 
al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 

10 immunoassays typically involve the measurement of complex formation between LIPAP and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering LIPAP epitopes is generally used, but a competitive binding assay may 
also be employed (Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 

15 may be used to assess the affinity of antibodies for LIPAP. Affinity is expressed as an association 
constant, K a , which is defined as the molar concentration of LIPAP-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The K a 
determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for 
multiple LIPAP epitopes, represents the average affinity, or avidity, of the antibodies for LIPAP. The 

20 K a determined for a preparation of monoclonal antibodies, which are monospecific for a particular 
LIPAP epitope, represents a true measure of affinity. High-affinity antibody preparations with K a 
ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the LIPAP- 
antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with K, 
ranging from about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar 

25 procedures which ultimately require dissociation of LIPAP, preferably in active form, from the 

antibody (Catty, D. (1988) Antibodies, Volume I: A Practical Approach , IRL Press, Washington, DC; 
Liddell, J.E. and Cryer, A. (1991) A Practical Guide to Monoclonal Antibodies , John Wiley & Sons, 
New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
30 the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of LIPAP-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 
35 Coligan et al. supra .) 
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In another embodiment of the invention, the polynucleotides encoding LIPAP, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, the complement of the 
polynucleotide encoding LIPAP may be used in situations in which it would be desirable to block the 
transcription of the mRNA. In particular, cells may be transformed with sequences complementary to 
5 polynucleotides encoding LIPAP. Thus, complementary molecules or fragments may be used to 
modulate LIPAP activity, or to achieve regulation of gene function. Such technology is now well 
known in the art, and sense or antisense oligonucleotides or larger fragments can be designed from 
various locations along the coding or control regions of sequences encoding LIPAP. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or 
10 from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, 
tissue, or cell population. Methods which are well known to those skilled in the art can be used to 
construct vectors to express nucleic acid sequences complementary to the polynucleotides encoding 
LIPAP. (See, e.g., Sambrook, supra ; Ausubel, 1995, supra .) 

Genes encoding LIPAP can be turned off by transforming a cell or tissue with expression 
15 vectors which express high levels of a polynucleotide, or fragment thereof, encoding LIPAP. Such 
constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in the 
absence of integration into the DNA, such vectors may continue to transcribe RNA molecules until they 
are disabled by endogenous nucleases. Transient expression may last for a month or more with a 
non-replicating vector, and may last even longer if appropriate replication elements are part of the 
20 vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5\ or regulatory 
regions of the gene encoding LIPAP. Oligonucleotides derived from the transcription initiation site, 
e.g., between about positions -10 and +10 from the start site, may be employed. Similarly, inhibition 

25 can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it 
causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber. B.E. and B.I. Carr, 
Molecular and Immunologic Approaches , Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 

30 complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 

35 engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
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endonucleolytic cleavage of sequences encoding LIPAP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 

5 corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 

10 any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding LIPAP. Such DNA sequences may be incorporated into a wide variety of vectors 
with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 

15 synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or 
tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends 
of the molecule, or the use of phosphorothioate or 2'O-methyl rather than phosphodiesterase linkages 

20 within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 

25 use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnol. 15:462-466.) 

30 Any of the therapeutic methods described above may be applied to any subject in need of such 

therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical or 
sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any of the 
35 therapeutic effects discussed above. Such pharmaceutical compositions may consist of LIPAP, 
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antibodies to LPAP, and mimetics, agonists, antagonists, or inhibitors of LIPAP. The compositions 
may be administered alone or in combination with at least one other agent, such as a stabilizing 
compound, which may be administered in any sterile, biocompatible pharmaceutical carrier including, 
but not limited to, saline, buffered saline, dextrose, and water. The compositions may be administered 
5 to a patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any number 
of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 
10 In addition to the active ingredients, these pharmaceutical compositions may contain suitable 

pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing 
of the active compounds into preparations which can be used pharmaceutically. Further details on 
techniques for formulation and administration may be found in the latest edition of Remington^ 
Pharmaceutical Sciences (Maack Publishing, Easton PA). 
15 Pharmaceutical compositions for oral administration can be formulated using pharmaceutically 

acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers 
enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, 
gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active compounds 
20 with solid excipient and processing the resultant mixture of granules (optionally, after grinding) to 

obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable excipients include 
carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, and sorbitol; starch 
from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including arabic and 
25 tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and alginic acid or a salt thereof, 
such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene 
30 glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for product identification or to 
characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
35 Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
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starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, 
the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or 
liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
5 aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's 
solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily 
injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, such as sesame oil, or 
10 synthetic fatty acid esters, such as ethyl oleatc, triglycerides, or liposomes. Non-lipid polycationic 
amino polymers may also be used for delivery. Optionally, the suspension may also contain suitable 
stabilizers or agents to increase the solubility of the compounds and allow for the preparation of highly 
concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier to be 
15 permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a manner 
that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, 
levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many acids, 
20 including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acids. 
Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free base 
forms. In other cases, the preparation may be a lyophilized powder which may contain any or all of the 
following: 1 mM to 50 mM histidine, 0. 1 % to 2% sucrose, and 2% to 7% mannitol, at a pH range of 4.5 
to 5.5, that is combined with buffer prior to use. 
25 After pharmaceutical compositions have been prepared, they can be placed in an appropriate 

container and labeled for treatment of an indicated condition. For administration of LIPAP, such 
labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
30 determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, or pigs. 
An animal model may also be used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 
35 administration in humans. 
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A therapeutically effective dose refers to that amount of active ingredient, for example LIPAP 
or fragments thereof, antibodies of LIPAP, and agonists, antagonists or inhibitors of LIPAP, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
5 calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 50 /ED 50 ratio. Pharmaceutical compositions which 
exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in such 

10 compositions is preferably within a range of circulating concentrations that includes the ED 50 with little 
or no toxicity. The dosage varies within this range depending upon the dosage form employed, the 
sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the subject 
requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 

15 moiety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. 
Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

20 Normal dosage amounts may vary from about 0.1 //g to 100,000 /ug, up to a total dose of about 

1 gram, depending upon the route of administration. Guidance as to particular dosages and methods of 
delivery is provided in the literature and generally available to practitioners in the art. Those skilled in 
the art will employ different formulations for nucleotides than for proteins or their inhibitors. 
Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, conditions, 

25 locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind LIPAP may be used for the 
diagnosis of disorders characterized by expression of LIPAP, or in assays to monitor patients being 
treated with LIPAP or agonists, antagonists, or inhibitors of LIPAP. Antibodies useful for diagnostic 

30 purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for LIPAP include methods which utilize the antibody and a label to detect LIPAP in human body 
fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and 
may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

35 A variety of protocols for measuring LIPAP, including ELISAs, RIAs, and FACS, are known in 
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the art and provide a basis for diagnosing altered or abnormal levels of LIPAP expression. Normal or 
standard values for LIPAP expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, for example, human subjects, with antibody to LIPAP under 
conditions suitable for complex formation. The amount of standard complex formation may be 
5 quantitated by various methods, such as photometric means. Quantities of LIPAP expressed in subject, 
control, and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding LIPAP may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 

10 complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of LIPAP may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
LIPAP, and to monitor regulation of LIPAP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 

15 sequences, including genomic sequences, encoding LIPAP or closely related molecules may be used to 
identify nucleic acid sequences which encode LIPAP. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding LIPAP, allelic variants, or related 

20 sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the LIPAP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO: 13-24 or from 
genomic sequences including promoters, enhancers, and introns of the LIPAP gene. 

25 Means for producing specific hybridization probes for DNAs encoding LIPAP include the 

cloning of polynucleotide sequences encoding LIPAP or LIPAP derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 

30 of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as 
alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding LIPAP may be used for the diagnosis of disorders 
associated with expression of LIPAP. Examples of such disorders include, but are not limited to, a 
cardiovascular disorder including blood vessel disorders such as arteriovenous fistula, atherosclerosis, 

35 hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, 
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thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon 
angioplasty, vascular replacement, and coronary artery bypass graft surgery; heart disorders such as 
congestive heart failure, ischemic heart disease, angina pectoris, myocardial infarction, hypertensive 
heart disease, degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid 
5 aortic valve, mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart 
disease, infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation; and lung disorders such 
as congenital lung anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, 
10 pulmonary hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive 
pulmonary disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, 
chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal 
pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, 
sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity 
15 pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary 
hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary 
involvement in collagen-vascular disorders, pulmonary alveolar proteinosis, lung tumors, inflammatory 
and noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced lung disease, 
radiation-induced lung disease, and complications of lung transplantation; a neurological disorder such 

20 as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 
retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 
and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 

25 thrombophlebitis, myelitis and radiculitis, viral central nervous system disease; prion diseases including 
kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome; fatal familial 
insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous 
sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorders of the central nervous system, cerebral palsy, neuroskeletal 

30 disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular 
dystrophy and other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis 
and polymyositis; inherited, metabolic, endocrine, and toxic myopathies; myasthenia gravis, periodic 
paralysis; mental disorders including mood, anxiety, and schizophrenic disorders; seasonal affective 
disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, 

35 paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, 
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corticobasal degeneration, and familial frontotemporal dementia; a gastrointestinal disorder such as 
dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal carcinoma, 
dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, gastroparesis, antral or 
pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal obstruction, infections of the 
5 intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, 
biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the liver, hepatoma, 
infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's disease, Mallory- 
Weiss syndrome, colonic carcinoma, colonic obstruction, irritable bowel syndrome, short bowel 
syndrome, diarrhea, constipation, gastrointestinal hemorrhage, acquired immunodeficiency syndrome 

10 (AIDS) enteropathy, jaundice, hepatic encephalopathy, hepatorenal syndrome, hepatic steatosis, 
hemochromatosis, Wilson's disease, alpha r antitrypsin deficiency, Reye's syndrome, primary 
sclerosing cholangitis, liver infarction, portal vein obstruction and thrombosis, centrilobular necrosis, 
peliosis hepatis, hepatic vein thrombosis, veno-occlusive disease, preeclampsia, eclampsia, acute fatty 
liver of pregnancy, intrahepatic cholestasis of pregnancy, and hepatic tumors including nodular 

15 hyperplasias, adenomas, and carcinomas; and a disorder of lipid metabolism such as fatty liver, 
cholestasis, primary biliary cirrhosis, carnitine deficiency, carnitine palmitoyltransferase deficiency, 
myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabry's disease, 
Gaucher's disease, Niemann-Pick's disease, metachromatic leukodystrophy, adrenoleukodystrophy, 
GM 2 gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier disease, 

20 hyperlipoproteinemia, diabetes mellitus, lipodystrophy, lipomatoses, acute panniculitis, disseminated 
fat necrosis, adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, lipomas, 
atherosclerosis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, primary 
hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithinxholesterol 
acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolemia, Tay- 

25 Sachs disease, Sandhoff's disease, hyperlipidemia, hyperlipemia, lipid myopathies, and obesity. The 
polynucleotide sequences encoding LIPAP may be used in Southern or northern analysis, dot blot, or 
other membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered LIPAP expression. 
Such qualitative or quantitative methods are well known in the art. 

30 In a particular aspect, the nucleotide sequences encoding LIPAP may be useful in assays that 

detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding LIPAP may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 

35 value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
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sample then the presence of altered levels of nucleotide sequences encoding LIPAP in the sample 
indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
treatment of an individual patient. 
5 In order to provide a basis for the diagnosis of a disorder associated with expression of LIPAP, 

a normal or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding LIPAP, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing the values obtained from normal subjects with 

10 values from an experiment in which a known amount of a substantially purified polynucleotide is used. 
Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

15 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
20 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

25 Additional diagnostic uses for oligonucleotides designed from the sequences encoding LIPAP 

may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding LEPAP, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
LIPAP, and will be employed under optimized conditions for identification of a specific gene or 

30 condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

Methods which may also be used to quantify the expression of LIPAP include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 

35 al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
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accelerated by running the assay in a high-throughput format where the oligomer of interest is 
presented in various dilutions and a spectrophotometric or coiorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
5 polynucleotide sequences described herein may be used as targets in a microarray. The microarray can 
be used to monitor the expression level of large numbers of genes simultaneously and to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, and to develop and 
monitor the activities of therapeutic agents. 

10 Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

15 In another embodiment of the invention, nucleic acid sequences encoding LIPAP may be used 

to generate hybridization probes useful in mapping the naturally occurring genomic sequence. The 
sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or to 
artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial 
chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI constructions, or single 

20 chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. ( 1997) Nat. Genet. 15:345-355; Price, 
CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 
mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 
965-968.) Examples of genetic map data can be found in various scientific journals or at the Online 

25 Mendelian Inheritance in Man (OMIM) World Wide Web site. Correlation between the location of the 
gene encoding LIPAP on a physical chromosomal map and a specific disorder, or a predisposition to a 
specific disorder, may help define the region of DNA associated with that disorder. The nucleotide 
sequences of the invention may be used to detect differences in gene sequences among normal, carrier, 
and affected individuals. 

30 In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the number or arm of a particular human chromosome is not known. 
New sequences can be assigned to chromosomal arms by physical mapping. This provides valuable 

35 information to investigators searching for disease genes using positional cloning or other gene 
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discovery techniques. Once the disease or syndrome has been crudely localized by genetic linkage to a 
particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences mapping to that area 
may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, R.A. et al. 
(1988) Nature 336:577-580.) The nucleotide sequence of the subject invention may also be used to 

5 detect differences in the chromosomal location due to translocation, inversion, etc., among normal, 
carrier, or affected individuals. 

In another embodiment of the invention, LIPAP, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 

10 solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between LIPAP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 

15 synthesized on a solid substrate. The test compounds are reacted with LIPAP, or fragments thereof, 
and washed. Bound LIPAP is then detected by methods well known in the art. Purified LIPAP can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

20 In another embodiment, one may use competitive drug screening assays in which neutralizing 

antibodies capable of binding LIPAP specifically compete with a test compound for binding LIPAP. In 

this manner, antibodies can be used to detect the presence of any peptide which shares one or more 

antigenic determinants with LIPAP. 

In additional embodiments, the nucleotide sequences which encode LEPAP may be used in any 
25 molecular biology techniques that have yet to be developed, provided the new techniques rely on 

properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present invention to its fullest extent. The following preferred specific 
30 embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 

of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 

particular U.S. Ser. No. 60/120,703 and U.S. Ser. No. 60/142,762, are hereby expressly incorporated by 

reference. 

35 
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EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed 
5 in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA purity. 

10 In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated using 
oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

15 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5. 1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 

20 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL SI 000, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography 
(Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs were ligated into 
compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT 

25 plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo 
Alto CA). Recombinant plasmids were transformed into competent E. coli cells including XL 1 -Blue, 
XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life 
Technologies. 

II. Isolation of cDNA Clones 

30 Plasmids were recovered from host cells by in vivo excision using the UNIZAP vector system 

(Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: a Magic or 
WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep purification kit (Edge 
Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, QIAWELL 8 
Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit from QIAGEN. 

35 Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and stored, with or 
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without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 

5 well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

cDNA sequencing reactions were processed using standard methods or high-throughput 

10 instrumentation such as the ABI CATALYST 800 (Perkin-Elmer) thermal cycler or the PTC-200 
thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or 
the MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 

15 Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Perkin-Elmer) in conjunction with standard ABI protocols 
and base calling software; or other sequence analysis systems known in the art. Reading frames within 
the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, supra, unit 

20 7.7). Some of the cDNA sequences were selected for extension using the techniques disclosed in 
Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 

25 references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 
algorithms used, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 

30 two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 

Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 

35 The polynucleotide sequences were validated by removing vector, linker, and polyA sequences 
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and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 
eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 

5 using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full 
length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were 
screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full 
length polynucleotide sequences were translated to derive the corresponding full length amino acid 
sequences, and these full length sequences were subsequently analyzed by querying against databases 

10 such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, 
Prosite, and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 
probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., Eddy, 
S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 

15 amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO: 13-24. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

20 and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, supra , 
ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in nucleotide databases such as GenBank or LIFESEQ (Incyte Pharmaceuticals). This 
25 analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 
computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of the search is the product score, which is defined as: 

% sequence identity x % maximum BLAST score 
100 

30 The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will be exact within 
a 1 % to 2% error, and, with a product score of 70, the match will be exact. Similar molecules are 
usually identified by selecting those which show product scores between 15 and 40, although lower 
scores may identify related molecules. 

35 The results of northern analyses are reported as a percentage distribution of libraries in which 
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the transcript encoding LIPAP occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
5 cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
3. 

V. Chromosomal Mapping of LIPAP Encoding Polynucleotides 

10 The cDNA sequences which were used to assemble SEQ ID NO: 19-24 were compared with 

sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO: 19-24 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 

15 from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences had 
been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment of all 
sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map location of SEQ ID NO:21 is described in The Invention as a range, or 

20 interval, of human chromosome 1 1 . The map position of an interval, in centiMorgans, is measured 
relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement 
based on recombination frequencies between chromosomal markers. On average, 1 cM is roughly 
equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to hot and cold 
spots of recombination.) The cM distances are based on genetic markers mapped by Genethon which 

25 provide boundaries for radiation hybrid markers whose sequences were included in each of the clusters. 
Human genome maps and other resources available to the public, such as the NCBI "GeneMap'99" 
World Wide Web site (http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if 
previously identified disease genes map within or in proximity to the intervals indicated above. 

VI. Extension of LIPAP Encoding Polynucleotides 

30 The full length nucleic acid sequences of SEQ ID NO: 1 3-24 were produced by extension of an 

appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3' extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 

35 in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
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temperatures of about 68 °C to about 72°C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed. 
5 High fidelity amplification was obtained by PCR using methods well known in the art. PCR 

was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 

10 pair PCI A and PCI B: Step 1 : 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

15 The concentration of DNA in each well was determined by dispensing 100 ul PICOGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 pi of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 

20 concentration of DNA. A 5 [A to 10 ju.] aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 

25 shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 

agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 

30 media, individual colonies were picked and cultured overnight at 37 °C in 384-well plates in LB/2x carb 
liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: 
Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, 
35 and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by 
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PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries were 
reamplified using the same conditions as described above. Samples were diluted with 20% 
dimethysulfoxide ( 1 :2, v/v), and sequenced using DYEN AMIC energy transfer sequencing primers and 
the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator 
5 cycle sequencing ready reaction kit (Perkin-Elmer). 

In like manner, the nucleotide sequences of SEQ ID NO: 13-24 are used to obtain 5 'regulatory 
sequences using the procedure above, oligonucleotides designed for such extension, and an appropriate 
genomic library. 

VII. Labeling and Use of Individual Hybridization Probes 

10 Hybridization probes derived from SEQ ID NO: 1 3-24 are employed to screen cDNAs, 

genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /iCi of [y- 32 P] adenosine 

15 triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 

20 Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 hours 
at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature under 
conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 

25 Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIII. Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize array elements 
on the surface of a substrate. (See, e.g., Baldeschweiler, supra .) An array analogous to a dot or slot 

30 blot may also be used to arrange and link elements to the surface of a substrate using thermal, UV, 
chemical, or mechanical bonding procedures. A typical array may be produced by hand or using 
available methods and machines and contain any appropriate number of elements. After hybridization, 
nonhybridized probes are removed and a scanner used to determine the levels and patterns of 
fluorescence. The degree of complementarity and the relative abundance of each probe which 

35 hybridizes to an element on the microarray may be assessed through analysis of the scanned images. 



WO 00/49043 



PCT/USOO/04160 



Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may comprise the 
elements of the microarray. Fragments suitable for hybridization can be selected using software well 
known in the art such as LASERGENE software (DNASTAR). Full-length cDNAs, ESTs, or 
fragments thereof corresponding to one of the nucleotide sequences of the present invention, or selected 

5 at random from a cDNA library relevant to the present invention, are arranged on an appropriate 
substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., UV cross-linking followed by 
thermal and chemical treatments and subsequent drying. (See, e.g., Schena, M. et al. (1995) Science 
270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) Fluorescent probes are prepared and 
used for hybridization to the elements on the substrate. The substrate is analyzed by procedures 

10 described above. 

IX. Complementary Polynucleotides 

Sequences complementary to the LTPAP-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring LIPAP. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 

15 smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of LEPAP. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the LIPAP-encoding transcript. 

20 X. Expression of LIPAP 

Expression and purification of LIPAP is achieved using bacterial or virus-based expression 
systems. For expression of LIPAP in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 

25 promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express LIPAP upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of LIPAP in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 

30 (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding LIPAP by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 

35 Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et 
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al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945.) 

In most expression systems, LIPAP is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

5 affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma japonicum , enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from LIPAP at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 

10 using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra, ch. 
10 and 16). Purified LIPAP obtained by these methods can be used directly in the following activity 
assay. 

15 XI. Demonstration of LIPAP Activity 

Selected candidate lipid molecules, such as C4 sterols, oxysterol, apolipoprotein E, and 
phospholipids, are arrayed in the wells of a multi-well plate. LEPAP, or biologically active fragments 
thereof, are labeled with 125 I Bolton-Hunter reagent. (See, e.g., Bolton A.E. and W.M. Hunter (1973) 
Biochem. J. 133:529-539.) The selected candidate lipid molecules are incubated with the labeled 

20 LIPAP and washed. Any wells with labeled LIPAP complex are assayed. Data obtained using 

different concentrations of LIPAP are used to calculate values for the number, affinity, and association 
of LIPAP with the candidate molecules. Significant binding of LIPAP to the candidate lipid molecules 
is indicative of LIPAP activity. 
XII. Functional Assays 

25 LIPAP function is assessed by expressing the sequences encoding LIPAP at physiologically 

elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include pCMV SPORT (Life Technologies) and pCR3.1 (Invitrogen, Carlsbad CA), both of which 
contain the cytomegalovirus promoter. 5-10 pig of recombinant vector arc transiently transfected into a 

30 human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 

formulations or electroporation. 1-2 /ug of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 
transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 

35 Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
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based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 

5 changes in cell size and granularity as measured by forward light scatter and 90 degree side light 
scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular proteins as measured by reactivity with 
specific antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are discussed 

10 in Ormerod, M.G. (1994) Flow Cytometry , Oxford, New York NY. 

The influence of LIPAP on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding LIPAP and either CD64 or CD64-GFP. CD64 and CD64- 
GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 

15 magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
mRNA can be purified from the cells using methods well known by those of skill in the art. Expression 
of mRNA encoding LIPAP and other genes of interest can be analyzed by northern analysis or 
microarray techniques. 

XIII. Production of LIPAP Specific Antibodies 

20 LIPAP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the LIPAP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenic ity, and a corresponding oligopeptide is 

25 synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra , ch. 1 1 .) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KLH (Sigma-Aldrich, St. 

30 Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 

immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-LIPAP 
activity by, for example, binding the peptide or LIPAP to a substrate, blocking with 1 % BSA, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
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XIV. Purification of Naturally Occurring LIPAP Using Specific Antibodies 

Naturally occurring or recombinant LIPAP is substantially purified by immunoaffinity 
chromatography using antibodies specific for LIPAP. An immunoaffinity column is constructed by 
covalently coupling anti-LIPAP antibody to an activated chromatographic resin, such as 
5 CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing LIPAP are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of LIPAP (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
10 antibody/LIPAP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as 
urea or thiocyanate ion), and LIPAP is collected. 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
15 Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the 
following claims. 
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What is claimed is: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, 

SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, and SEQ ID NO: 12, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, and SEQ ID NO: 12, 
10 c) a biologically active fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID 
NO:ll, and SEQ ID NO: 12, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID 
15 NO:ll,andSEQIDNO:12. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:2, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1 , and SEQ 
ID NO: 12. 

20 

3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. An isolated polynucleotide of claim 3 selected from the group consisting of SEQ ID NO: 1 3, 
SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:21, SEQ ID NO:22, 

25 SEQ ID NO:23, and SEQ ID NO:24. 

5. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

30 6. A cell transformed with a recombinant polynucleotide of claim 5. 

7. A transgenic organism comprising a recombinant polynucleotide of claim 5. 

8. A method for producing a polypeptide of claim 1 , the method comprising: 
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a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide comprises 
a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 1, and 

b) recovering the polypeptide so expressed. 

5 

9. An isolated antibody which specifically binds to a polypeptide of claim 1 . 

10. An isolated polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of: 

10 a) a polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 3, SEQ ID 

NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:21, SEQ ID NO:22, SEQ ID 
NO:23, and SEQ ID NO:24, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 13, SEQ ID NO: 14, SEQ 

15 ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, and SEQ 
ID NO:24, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 

20 

1 1 . An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 10. 

12. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
25 having a sequence of a polynucleotide of claim 10, the method comprising: 

a) hybridizing the sample with a probe comprising at least 16 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex 
is formed between said probe and said target polynucleotide, and 
30 b) detecting the presence or absence of said hybridization complex, and, optionally, if present, 

the amount thereof. 



13. A method of claim 12, wherein the probe comprises at least 30 contiguous nucleotides. 
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14. A method of claim 12, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 1 
and a pharmaceutically acceptable excipient. 

5 

1 6. A method for treating a disease or condition associated with decreased expression of 
functional LIPAP, comprising administering to a patient in need of such treatment the pharmaceutical 
composition of claim 15. 

10 17. A method for screening a compound for effectiveness as an agonist of a polypeptide of 

claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

15 18. A pharmaceutical composition comprising an agonist compound identified by a method of 

claim 1 7 and a pharmaceutically acceptable excipient. 

19. A method for treating a disease or condition associated with decreased expression of 
functional LIPAP, comprising administering to a patient in need of such treatment a pharmaceutical 

20 composition of claim 18. 

20. A method for screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
25 b) detecting antagonist activity in the sample. 

21. A pharmaceutical composition comprising an antagonist compound identified by a method 
of claim 20 and a pharmaceutically acceptable excipient. 

30 22. A method for treating a disease or condition associated with overexpression of functional 

LIPAP, comprising administering to a patient in need of such treatment a pharmaceutical composition 
of claim 21 . 

23. A method for screening a compound for effectiveness in altering expression of a target 
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polynucleotide, wherein said target polynucleotide comprises a sequence of claim 4, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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Leu 


His 
160 


His 


Tyr 


Arg 


Asn 


Leu 

165 


Ser 


Glu 


Phe 


Phe 


Arg 
170 




Lys 


Leu 




Pro 
175 


Gin 


Ala 


Arg 


Pro 


Val 
180 


Cys 


Gly 


Leu 


His 


Ser 
185 


Val 


He 


Ser 


Pro 


Ser 
190 


Asp 


Gly 




He 


195 


Asn 


Phe 


Gly 


Gin 


Val 
200 


Lys 


Asn 


Cys 


Glu 


Val 
205 


Glu 


Gin 


Val 


Lys 


Gly 

210 


Val 


Thr 


Tyr 


Ser 


Leu 
215 


Glu 


Ser 


Phe 


Leu 


Gly 
220 


Pro 


Arg 


Met 


Cys 


Thr 
225 


Glu 


Asp 


Leu 


Pro 


Phe 
230 


Pro 


Pro 


Ala 


Ala 


Ser 
235 


Cys 


Asp 


Ser 


Phe 


Lys 
240 


Asn 


Gin 


Leu 


Val 


Thr 


Arg 


Glu 


Gly Asn 


Glu 


Leu 


Tyr 


His 


Cys 


Val 










245 










250 










255 


He 


Tyr 




Ala 


Pro 
260 


Gly 


Asp 


Tyr 


His 


Cys 
265 


Phe 


His 


Ser 


Pro 


Thr 
270 
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Asp 


Trp 


Thr 


Val 


Ser 


His 


Arg 


Arg 


His 


Phe 


Pro 


Gly 


Ser 


Leu 


Met 


Ser 


Val 


Asn 


Pro 


Gly 


Met 


Ala 


Arg 


Trp 


He 


Lys 


Glu 


Leu 


Phe 


Cys 


His 


Asn 


Glu 


Arg 


Val 


Val 


Leu 


Thr 


Gly 


Asp 


Trp 


Lys 


His 


Gly 


Phe 


Phe 


Ser 


Leu 


Thr 


Ala 


Val 


Gly 


Ala 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Arg 


He 


Tyr 


Phe 


Asp 


Arg 


Asp 


Leu 


His 


Thr 


Asn 


Ser 


Pro 


Arg 


His 


Ser 


Lys 


Gly 


Ser 


Tyr 


Asn 
350 


Asp 


Phe 


Ser 


Phe 


Val 
355 


Thr 


His 


Thr 


Asn 


Arg 

360 


Glu 


Gly 


Val 


Pro 


Met 

365 




Lys 


Gly 


Glu 


His 
370 


Leu 


Gly 


Glu 


Phe 


375 


Leu 


Gly 


Ser 


Thr 


He 
380 


Val 




He 


Phe 


Glu 
385 


Ala 


Pro 


Lys 


Asp 


Phe 
390 


Asn 


Phe 


Gin 


Leu 


Lys 
395 


Thr 


Gly 


Gin 


Lys 


lie 
400 


Arg 


Phe 


Gly 


Glu 


Ala 
405 


Leu 


Gly 


Ser 



























<210> 4 

<211> 759 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2766980CD1 

<400> 4 



Met 


Glu 


Ser 


Ser 


Pro 


Phe 




Arg 


Arg 


Gin 


Trp 


Thr 


Ser 


Leu 


Ser 


1 








5 










10 










15 


Leu 


Arg 


Val 


Thr 


Ala 
20 


Lys 


Glu 


Leu 


Ser 


Leu 
25 


Val 


Asn 






Lys 

30 


Ser 


Ser 


Ala 


He 


Val 

35 


Glu 


He 


Phe 


Ser 


Lys 
40 


Tyr 


Gin 


Lys 


Ala 


Ala 

45 


Glu 


Glu 


Thr 


Asn 


Met 
50 


Glu 






Arg 


Ser 
55 




Thr 


Glu 


Asn 


Leu 

60 


Ser 


Gin 


His 


Phe 


Arg 
65 


Lys 


Gly 


Thr 


Leu 


Thr 
70 


Val 


Leu 


Lys 


Lys 


Lys 
75 


Trp 


Glu 


Asn 


Pro 


Gly 
80 


Leu 


Gly 


Ala 


Glu 


Ser 
85 


His 


Thr 


Asp 


Ser 


Leu 
90 


Arg 


Asn 


Ser 


Ser 


Thr 
95 


Glu 


He 


Arg 


His 


Arg 
100 


Ala 




His 


Pro 


Pro 
105 


Ala 


Glu 


Val 


Thr 


Ser 
110 


His 


Ala 


Ala 


Ser 


Gly 
115 


Ala 


Lys 


Ala 


Asp 


Gin 
120 


Glu 


Glu 


Gin 


He 


His 
125 


Pro 




Ser 


Arg 


Leu 
130 




Ser 


Pro 


Pro 


Glu 
135 


Ala 


Leu 


Val 


Gin 


Gly 
140 


Arg 


Tyr 


Pro 


His 


He 
145 


Lys 


Asp 


Gly 


Glu 


Asp 
150 


Leu 


Lys 


Asp 


His 


Ser 
155 


Thr 


Glu 


Ser 


Lys 


Lys 
160 


Met 


Glu 


Asn 


Cys 


Leu 
165 


Gly 


Glu 


Ser 


Arg 


His 
170 


Glu 


Val 


Glu 


Lys 


Ser 
175 


Glu 


He 


Ser 


Glu 


Asn 
180 


Thr 


Asp 


Ala 


Ser 


Gly 

185 




He 


Glu 


Lys 


Tyr 
190 


Asn 


Val 


Pro 


Leu 


Asn 
195 


Arg 


Leu 


Lys 


Met 


Met 
200 


Phe 


Glu 


Lys 


Gly 


Glu 
205 


Pro 


Thr 


Gin 


Thr 


Lys 
210 


He 


Leu 


Arg 


Ala 


Gin 
215 


Ser 


Arg 


Ser 


Ala 


Ser 
220 


Gly 


Arg 


Lys 


He 


Ser 
225 


Glu 


Asn 


Ser 


Tyr 


Ser 
230 


Leu 


Asp 


Asp 


Leu 


Glu 
235 


He 


Gly 


Pro 


Gly 


Gin 
240 


Leu 


Ser 


Ser 


Ser 


Thr 
245 


Phe 


Asp 


Ser 


Glu 


Lys 
250 


Asn 


Glu 


Ser 


Arg 


Arg 

255 


Asn 


Leu 


Glu 


Leu 


Pro 
260 


Arg 


Leu 


Ser 


Glu 


Thr 

265 


Ser 


He 


Lys 


Asp 


Arg 
270 


Met 


Ala 


Lys 


Tyr 


Gin 


Ala 


Ala 


Val 


Ser 


Lys 


Gin 


Ser 


Ser 


Ser 


Thr 
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275 








280 










285 


Asn Tyr 


Thr 


Asn 


Glu 


Leu 


Lys 


Ala Ser 


Gly Gly 


Glu 


He 


Lys 


He 






290 








295 










300 


His Lys 


Met 


Glu 


Gin 


Lys 


Glu 


Asn Val 


Pro 


Pro 


Gly 


Pro 


Glu 


Val 






305 








310 










315 


Cys He 


Thr 


His 


Gin 


Glu 


Gly 


Glu Lys 


He 


Ser 


Ala 


Asn 


Glu 


Asn 






320 








325 










330 


Ser Leu 


Ala 


Val 


Arg 
335 


Ser 


Thr 


Pro Ala 


Glu 
340 


Asp 


Asp 


Ser 


Arg 


Asp 
345 


Ser Gin 


Val 


Lys 


Ser 


Glu 


Val 


Gin Gin 


Pro 


Val 


His 


Pro 


Lys 


Pro 






350 








355 










360 


Leu Ser 


Pro 


Asp 


Ser 


Arg 


Ala 


Ser Ser 


Leu 


Ser 


Glu 


Ser 


Ser 


Pro 






365 








370 










375 


Pro Lys 


Ala 


Met 


Lys 


Lys 


Phe 


Gin Ala 


Pro 


Ala 


Arg 


Glu 


Thr 


Cys 






380 








385 










390 


Val Glu 


Cys 


Gin 


Lys 


Thr 


Val 


Tyr Pro 


Met 


Glu 


Arg 


Leu 


Leu 


Ala 






395 








400 










405 


Asn Gin 


Gin 


Val 


Phe 
410 


His 


He 


Ser Cys 


Phe 
415 


Arg 


Cys 


Ser 


Tyr 


Cys 
420 


Asn Asn 


Lys 


Leu 


Ser 


Leu 


Gly 


Thr Tyr 


Ala 


Ser 


Leu 


His 


Gly 


Arg 






425 








430 










435 


He Tyr 


Cys 


Lys 


Pro 


His 


Phe 


Asn Gin 


Leu 


Phe 


Lys 


Ser 


Lys 


Gly 


440 








445 










450 


Asn Tyr 


Asp 


Glu 


Gly 


Phe 


Gly 


His Arg 


Pro 


His 


Lys 


Asp 


Leu 


Trp 




455 








460 










465 


Ala Ser 


Lys 


Asn 


Glu 


Asn 


Glu 


Glu He 


Leu 


Glu 


Arg 


Pro 


Ala 


Gin 






470 








475 










480 


Leu Ala 


Asn 


Ala 


Arg 
485 


Glu 


Thr 


Pro His 


Ser 
490 


Pro 


Gly 


Val 


Glu 


Asp 
495 


Ala Pro 


He 


Ala 


Lys 
500 


Val 


Gly 


Val Leu 


Ala 
505 


Ala 


Ser 


Met 


Glu 


Ala 
510 


Lys Ala 


Ser 


Ser 


Gin 


Gin 


Glu 


Lys Glu 


Asp 


Lys 


Pro 


Ala 


Glu 


Thr 






515 








520 










525 


Lys Lys 


Leu 


Arg 


He 


Ala 


Trp 


Pro Pro 


Pro 


Thr 


Glu 


Leu 


Gly 


Ser 






530 








535 










540 


Ser Gly 


Ser 


Ala 


Leu 


Glu 


Glu 


Gly He 


Lys 


Met 


Ser 


Lys 


Pro 


Lys 






545 








550 










555 


Trp Pro 


Pro 


Glu 


Asp 


Glu 


He 


Ser Lys 


Pro 


Glu 


Val 


Pro 


Glu 


Asp 






560 








565 










570 


Val Asp 


Leu 


Asp 


Leu 


Lys 


Lys 


Leu Arg 


Arg 


Ser 


Ser 


Ser 


Leu 


Lys 






575 








580 










585 


Glu Arg 


Ser 


Arg 


Pro 


Phe 


Thr 


Val Ala 


Ala 


Ser 


Phe 


Gin 


Ser 


Thr 




590 








595 










600 


Ser Val 


Lys 


Ser 


Pro 


Lys 


Thr 


Val Ser 


Pro 


Pro 


He 


Arg 


Lys 


Gly 






605 








610 










615 


Trp Ser 


Met 


Ser 


Glu 


Gin 


Ser 


Glu Glu 


Ser 


Val 


Gly 


Gly 


Arg 


Val 






620 








625 










630 


Ala Glu 


Arg 


Lys 


Gin 


Val 


Glu 


Asn Ala 


Lys 


Ala 


Ser 


Lys 


Lys 


Asn 




635 








640 










645 


Gly Asn 


Val 


Gly 


Lys 


Thr 


Thr 


Trp Gin 


Asn 


Lys 


Glu 


Ser 


Lys 


Gly 






650 








655 










660 


Glu Thr 


Gly 


Lys 


Arg 


Ser 


Lys 


Glu Gly 


His 


Ser 


Leu 


Glu 


Met 


Glu 




665 








670 










675 


Asn Glu 


Asn 


Leu 


Val 

680 


Glu 


Asn 


Gly Ala 


Asp 
685 


Ser 


Asp 


Glu 


Asp 


Asp 


Asn Ser 


Phe 


Leu 


Lys 
695 


Gin 


Gin 


Ser Pro 


Gin 
700 


Glu 


Pro 


Lys 


Ser 


Leu 


Asn Trp 


Ser 


Ser 


Phe 


Val 


Asp 


Asn Thr 


Phe 


Ala 


Glu 


Glu 


Phe 


Thr 






710 








715 










720 


Thr Gin 


Asn 


. Gin 


l Lys 
725 


Ser 


Gin 


Asp Val 


Glu 
730 


Leu 


Trp 


Glu 


Gly 


• Glu 
735 


Val Val 


Lys 


Glu 


i Leu 


Ser 


Val 


Glu Glu 


. Gin 


He 


Lys 


Arg 


Asn 


i Arg 






740 








745 










750 


Tyr Tyr 


■ Asp 


, Gli: 


755 


Glu 


. Asp 


i Glu Glu 
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<211> 226 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID 

<400> 5 

Met Glu lie Val 
1 

Phe Gly Asp His 
Asn He Leu Ser 
Val He Lys Asn 
Phe He Lys Ala 
Gly Thr Val Phe 
Gly Lys Trp His 
Ala Cys Val Trp 
Tyr Tyr Ser Phe 
Pro Ser Ser Lys 
Pro Asp Gin Arg 
He Gin Lys Gin 
Val Leu Glu Glu 
Lys Ser Asp Asp 
Leu Arg Lys Asp 
Trp 



<210> 6 

<211> 500 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5324145CD1 

<400> 6 



Met 


Tyr 


Cys 


Pro 


Glu 


Ser 


Ala 


Val 


He 


Leu 


Leu 


Ser 


Thr 


Thr 


Val 


1 








5 










10 










15 


Leu 


Glu 


Asn 


Val 


Leu 
20 


Gin 


Pro 


Phe 


His 


Phe 
25 


Arg 


Ala 


Gly 


Thr 


Met 
30 


Ser 


Lys 


Leu 


Pro 


Lys 
35 


Phe 


Glu 


He 


Glu 


Leu 
40 


Pro 


Ala 


Ala 


Pro 


Lys 

45 


Ser 


Thr 


Lys 


Pro 


Ser 
50 


Leu 


Ser 


Glu 


Arg 


Asp 
55 


He 


Ala 


Met 


Ala 


Thr 

60 


He 


Tyr 


Gly 


Gin 


Leu 
65 


Tyr 


Val 


Leu 


Phe 


Leu 

70 


Arg 


His 


His 


Ser 


Arg 
75 


Thr 


Ser 


Asn 


Ser 


Thr 
80 


Gly 


Ala 


Glu 


Val 


Val 
85 


Leu 


Tyr 


His 


Leu 


Pro 
90 


Arg 


Glu 


Gly 


Ala 


Cys 
95 


Lys 




Met 


His 


He 
100 




Lys 


Leu 


Asn 


Arg 
105 


Thr 


Gly 




Phe 


Ala 
110 


Leu 


Asn 


Val 


Val 


Asp 
115 


Asn 


Leu 


Val 


Val 


Val 
120 



6 



No: 2768356CD1 

Pro He Gly Thr Thr His Val Thr Leu Pro Val 

5 10 15 

Phe Glu Trp Asn Lys Val Thr Ser Cys lie His 

20 25 30 

Gly Gin Arg Trp He Glu His Tyr Gly Glu He 

35 40 45 

Leu His Asp Asp Ser Cys Tyr Cys Lys Val Asn 

50 55 60 

Lys Tyr Trp Ser Thr Asn Ala His Glu He Glu 

65 70 75 

Asp Arg Ser Gly Lys Ala Val His Arg Leu Phe 

80 85 90 

Glu Ser He Tyr Cys Gly Gly Gly Ser Ser Ser 

95 100 105 

Arg Ala Asn Pro Met Pro Lys Gly Tyr Glu Gin 

110 115 120 

Thr Gin Phe Ala Leu Glu Leu Asn Glu Met Asp 

125 130 135 

Ser Leu Leu Pro Pro Thr Asp Thr Arg Phe Arg 

140 145 150 

Phe Leu Glu Glu Gly Asn Leu Glu Glu Ala Glu 

155 160 165 
Arg He Glu Gin Leu Gin Arg Glu Arg Arg Arg 

170 175 180 

Asn His Val Glu His Gin Pro Arg Phe Phe Arg 

185 190 195 
Asp Ser Trp Val Ser Asn Gly Thr Tyr Leu Glu 

200 205 210 
Leu Gly Phe Ser Lys Leu Asp His Pro Val Leu 

215 220 225 
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His 


His 


Gin 


Asp 


Thr 
125 


Glu 


Thr 


Ser 


Val 


He 
130 


Phe 


Asp 


He 


Lys 


Leu 

135 


Arg 


Gly 


Glu 


Phe 


Asp 


Gly 


Ser 


Val 


Thr 


Phe 


His 


His 


Pro 


Val 


Leu 






140 










145 










150 


Pro 


Ala 


Arg 


Ser 


He 


Gin 


Pro 


Tyr 


Gin 


He 


Pro 


He 


Thr 


Gly 


Pro 








155 










160 










165 


Ala 


Ala 


Val 


Thr 


Ser 
170 


Gin 


Ser 


Pro 


Val 


Pro 
175 


Cys 


Lys 


Leu 


Tyr 


Ser 
180 


Ser 


Ser 


Trp 


He 


Val 


Phe 


Gin 


Pro 


Asp 


He 


He 


He 


Ser 


Ala 


Ser 








185 










190 










195 


Gin 


Gly 


Tyr 


Leu 


Trp 


Asn 


Leu 


Gin 


Val 


Lys 


Leu 


Glu 


Pro 


He 


Val 








200 










205 










210 


Asn 


Leu 


Leu 


Pro 


Asp 
215 


Lys 


Gly 


Arg 


Leu 


Met 
220 


Asp 


Phe 


Leu 


Leu 


Gin 
225 


Arg 


Lys 


Glu 


Cys 


Lys 


Met 


Val 


He 


Leu 


Ser 


Val 


Cys 


Ser 


Gin 


Met 








230 










235 










240 


Leu 


Ser 


Glu 


Ser 


Asp 
245 


Arg 


Ala 


Ser 


Leu 


Pro 
250 


Val 


He 


Ala 


Thr 


Val 
255 


Phe 


Asp 


Lys 


Leu 


Asn 


His 


Glu 


Tyr 


Lys 


Lys 


Tyr 


Leu 


Asp 


Ala 


Glu 






260 










265 










270 


Gin 


Ser 


Tyr 


Ala 


Met 


Ala 


Val 


Glu 


Ala 


Gly 


Gin 


Ser 


Arg 


Ser 


Ser 








275 










280 










285 


Pro 


Leu 


Leu 


Lys 


Arg 


Pro 


Val 


Arg 


Thr 


Gin 


Ala 


Val 


Leu 


Asp 


Gin 








290 










295 










300 


Ser 


Asp 


Val 


Tyr 


Thr 


His 


Val 


Leu 


Ser 


Ala 


Phe 


Val 


Glu 


Lys 










305 










310 










315 


Glu 


Met 


Pro 


His 


320 


Phe 


Val 


He 


Ala 


Val 
325 


Leu 


Met 


Glu 


Tyr 


He 
330 


Arg 


Ser 


Leu 


Asn 


Gin 


Phe 


Gin 


He 


Ala 


Val 


Gin 


His 


Tyr 


Leu 


His 








335 










340 










345 


Glu 


Leu 


Val 


He 


Lys 
350 


Thr 


Leu 


Val 


Gin 


His 
355 


Asn 


Leu 


Phe 


Tyr 


Met 
360 


Leu 


His 


Gin 


Phe 


Leu 
365 


Gin 


Tyr 


His 


Val 


Leu 
370 


Ser 


Asp 


Ser 


Lys 


Pro 
375 


Leu 


Ala 


Cys 


Leu 


Leu 
380 


Leu 


Ser 


Leu 


Glu 


Ser 
385 


Phe 


Tyr 


Pro 


Pro 


Ala 
390 


His 


Gin 


Leu 


Ser 


395 


Asp 


Met 


Leu 


Lys 


400 


Leu 


Ser 


Thr 


Ala 


Asn 
405 


Asp 


Glu 


He 


Val 


Glu 


Val 


Leu 


Leu 


Ser 


Lys 


His 


Gin 


Val 


Leu 


Ala 








410 










415 










420 


Ala 


Leu 


Arg 


Phe 


He 
425 


Arg 


Gly 


He 


Gly 


Gly 
430 


His 


Asp 


Asn 


He 


Ser 
435 


Ala 


Arg 


Lys 


Phe 


Leu 


Asp 


Ala 


Ala 


Lys 


Gin 


Thr 


Glu 


Asp 


Asn 


Met 








440 










445 












Leu 


Phe 


Tyr 


Thr 


He 
455 


Phe 


Arg 


Phe 


Phe 


Glu 
460 


Gin 


Arg 


Asn 


Gin 


Arg 
465 


Leu 


Arg 


Gly 


Ser 


Pro 


Asn 


Phe 


Thr 


Pro 


Gly 


Glu 


His 


Cys 


Glu 


Glu 








470 










475 










480 


His 


Val 


Ala 


Phe 


Phe 


Lys 


Gin 


He 


Phe 


Gly Asp 


Gin 


Ala 


Leu 


Met 










485 










490 










495 




Pro 


Thr 


Thr 


Phe 
500 























<210> 7 

<211> 272 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1004646CD1 

<400> 7 

Met Ser Cys His Asn Cys Ser Asp Pro Gin Val Leu Cys Ser Ser 

15 10 15 

Gly Gin Leu Phe Leu Gin Pro Leu Trp Asp His Leu Arg Ser Trp 

20 25 30 
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Glu 


Ala 


Leu 


Leu 


Gin 

35 


Ser 


Pro 


Phe 


Thr 


Thr 


Tyr 


Val 


Gly 
50 


Phe 


Cys 


Leu 


Leu 


Cys 


Ser 


Trp 


Val 
65 


Pro 


Ala 


Leu 


Asp 


Phe 


Ser 


Pro 


Ser 


Ala 


Gin 


Gin 








80 








Thr 


Leu 


Tyr 


Gin 


His 
95 


Val 


Met 


Phe 


His 


Trp Ala 


Arg 


Ser 


Pro 


Ala 


Leu 










110 








Leu 


Leu 


Leu 


Leu 


Leu 
125 


His 


His 


lie 


Asp 


Met 


Glu 


Phe 


Phe 
140 


Val 


Trp 


His 


Trp 


Leu 


Tyr 


Arg 


Thr 

155 


Phe 


His 


Lys 


Ser 


Phe 


Ala 


Leu 


Ala 
170 


Thr 


Gin 


Tyr 


Ser 




Gly 


Phe 


Phe 
185 


Asp 


Met 


Met 


His 


Pro 


Leu 


Thr 


Thr 

200 


Leu 


Thr 


Phe 


Ser 


Val 


Glu 




His 

215 


Ser 


Gly 


Tyr 






Val 


Pro 


Phe 
230 


Gly 


Trp 


Tyr 


Leu 


His 


His 


Ser 


His 
245 


Phe 


Asn 


Cys 


His 


Trp 


Asp 


Lys 


He 
260 


Leu 


Gly 


Thr 


Ala Arg 















<210> 8 

<211> 282 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1802851CD1 

<400> 8 



Met Ser 


Gly 


Gly 


Trp 


Met 


Ala 


Gin 


1 






5 








Ala Leu 


Gly 


Leu 


Ala 


Leu 


Leu 


Leu 








20 








Leu Glu 


Ala 


Ala 


Ala 


Ser 


Pro 


Leu 








35 








Ala Ala 


Gly 


Pro 


Ser 


Ser 


Gly 


Ser 








50 








Cys Arg 


Thr 


Ser 


Gly 


Leu 


Cys 


Val 








65 








Arg Asp 


Leu 


Asp 


Cys 


Ser 


Asp 


Gly 








80 








He Glu 


Pro 


Cys 


Thr 


Gin 


Lys 


Gly 








95 








Leu Pro 


Cys 


Pro 


Cys 


Thr 


Gly 


Val 








110 








Asp Lys 


Lys 


Leu 


Arg 


Asn 


Cys 


Ser 








125 








Glu Leu 


Arg 


Cys 


Thr 


Leu 


Ser 


Asp 








140 








Arg Cys 


Asp Gly 


His 


Pro 


Asp 


Cys 








155 








Gly Cys 


Gly Thr 


Asn 


Glu 


He 





Phe 


Pro 


Val 


He 


Phe 


Ser 


He 




40 










45 


Pro 


Phe 


Val 


Val 


Leu 




He 




55 










60 


Arg 


Arg 


Tyr 


Lys 


He 


His 


Pro 




70 










75 


Leu 


Leu 


Pro 


Cys 


Leu 


Gly Gin 




85 










90 


Val 


Phe 


Pro 


Val 


Thr 


Leu 


Leu 




100 










105 


Leu 


Pro 


His 


Glu 


Ala 


Pro 


Glu 




115 
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Ser Gly Val Ala Ala Val Leu Ala Gly Thr Thr Glu Ala lie Phe 

125 130 135 

Thr Pro Leu Glu Arg Val Gin Thr Leu Leu Gin Asp His Lys His 

140 145 150 

His Asp Lys Phe Thr Asn Thr Tyr Gin Ala Phe Lys Ala Leu Lys 

155 160 165 

Cys His Gly lie Gly Glu Tyr Tyr Arg Gly Leu Val Pro lie Leu 

170 175 180 

Phe Arg Asn Gly Leu Ser Asn Val Leu Phe Phe Gly Leu Arg Gly 

185 190 195 

Pro lie Lys Glu His Leu Pro Thr Ala Thr Thr His Ser Ala His 

200 205 210 

Leu Val Asn Asp Phe lie Cys Gly Gly Leu Leu Gly Ala Met Leu 

215 220 225 

Gly Phe Leu Phe Phe Pro lie Asn Val Val Lys Thr Arg lie Gin 

230 235 240 

Ser Gin lie Gly Gly Glu Phe Gin Ser Phe Pro Lys Val Phe Gin 

245 250 255 

Lys lie Trp Leu Glu Arg Asp Arg Lys Leu lie Asn Leu Phe Arg 

260 265 270 

Gly Ala His Leu Asn Tyr His Arg Ser Leu He Ser Trp Gly He 

275 280 285 
He Asn Ala Thr Tyr Glu Phe Leu Leu Lys Val He 

290 295 



<210> 13 
<211> 2174 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 161190CB1 

<400> 13 

ggatccacac agctcagaac agctggatct tgctcacact ctttcaagag aagcttcctt 60 

ggacaaaagg accctgcctt ggtgtgagag tgagggcaga gggagctgga gcaagtagaa 12 0 

tttctctaaa taccagctgg ctggggccca ggagattaaa aaacaccggg ctagggttaa 180 

gaaaaaaaac gaacccttcc agtcaggtca gtgactggag agctccaagg aaagtctctc 240 

agtgacctgg ctgctggcac catggactca gaaaagaaac gctttactga agaggccacc 300 

aaatacttcc gggagagagt cagcccagtg catctgcaaa tcctgctgac taacaatgaa 360 

gcctggaaga gattcgtgac tgcggctgaa ttgcccaggg atgaggcaga tgctctctac 420 

gaagctctga agaagcttag aacatatgca gctattgagg acgaatatgt gcagcagaaa 480 

gatgagcagt ttagggaatg gtttttgaaa gagtttcccc aagtcaagag gaagatccag 540 

gagtccatag aaaagcttcg tgcccttgca aatggtattg aagaggtcca cagaggctgc 600 

accatctcca atgtggtgtc cagctccact ggcgctgcct ctggcatcat gtcccttgct 660 

ggtcttgttt tggcaccatt tacagcaggg acgagtctgg cccttactgc agctggggta 720 

gggctgggag cagcgtctgc tgtgactggg atcaccacca gcatcgtgga gcactcatac 780 

acatcatcag cagaagctga agccagcagg ctgactgcaa ccagcattga ccgatcgaag 840 

gtatttaagg aagttatgcg tgacatcaca cccaacttac tttcccttct taataattat 900 

tacgaagcca cacaaaccat tgggagtgaa atccgtgcca tcaggcaagc cagagccagg 960 

gcccgactcc ctgtgaccac ctggcgaatc tcagctggaa gtggaggtca agcagagaga 102 0 

acgattgcag gcaccacccg ggcagtgagc agaggagccc ggatcctgag tgcgaccact 1080 

tcaggcatct tccttgcact ggatgtggtc aaccttgtat acgagtcaaa gcacttgcat 1140 

gagggggcaa agtctgcatc tgctgaggag ctgaggcggc aggctcagga gctggaggag 1200 

aatctaatgg agctcactca gatctatcag cgtctgaatc catgccatac ccactgaccc 1260 

cagaccagtg cagccagcag gggaggtgag ccatacacag gccacgacaa aatgcaggca 132 0 

ctttattagg gggataaaga gggcaaggta aagtttatgg agctgagtgt tagtgacttt 1380 

ggcatttctg tagctgagca cagcagggga ggggttaatg cagatggcaa gtgcaccaag 1440 

gagaaggcag gaatgctgga gcctggaaca agggaagaga ggggactgga gagtgtgggg 1500 

aataggaaga agaaatttcc tttagactaa cgaatatatt ggggggagga atagagggga 1560 

ggtgtgcagg aaccagcaat gagaaggcca ggaaaagaaa gagctgaaaa tgcagaaagc 162 0 

cgaagagtta gaacttttgg atacagcaga agaaacagcg gctccactac cgacctgccc 1680 

ccggttcgat gtccttccaa gaatgaagtc tttccctggt gatggtcccc tgccctgtct 1740 

ttccagcatc cactctgtct tgtcctcctg gaagtgtatc tcagtcagcc agtggcttct 1800 

tgatgatggc ggtggaggtg gtggttgtag tgtgatggat cccctttagg ttatttaggg 1860 

gtatatgtcc cctgctcgaa ccctgaaggc caggtaatga gccatggcca ttgtccccag 1920 
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ctgaggacca ggtgtctcta aaaacccaaa catcctggag agtatgcgag aacctaccaa 1980 

gaaaaacagt ctcattactc acatacagca ggcaaagaga cagaaaatta actgaaaagc 2040 

agtttagaga ccgggggagg ccggatctct agagccatcc tgctgagtgc cctgtgtgta 2100 

agtcctaata aactcaccta ctcaccaaaa aaaaaaacga aaaactaaag aacaggagaa 2160 

aaaaagggga gggc 2174 



<210> 14 

<211> 2620 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1292575CB1 

<400> 14 

agtgagcccg gccggcaacc gagggacccg 
tccaagagaa agtttgtaaa attccttaca 
gtagaagagc acatgtcagg ggcagtggag 
gaattctttg atgccgtcac aggctttgat 
gcaaatcaga aagtcacggg aatgattgac 
aaaactgggg agaggccctc tcaagagaac 
gctcccatgt tcagcagaag cgacttcagc 
ctggagctgt ccaagatcac gatgccaatc 
cggatcacgg agtacatgga gcacgtgtac 
cccctggaga ggatgcagtc tgtggctgct 
gagaggaccg gcaaaccatt taatccactc 
gatttaggat tcagatttat atcggaacag 
cactcggaag gtctcaacca tgacttcctg 
ttctggggca aaagcgtgga ggcggagccc 
cataatgaag cctacacctg gaccaacccc 
aagctgtgga tagagcagta tgggacagtg 
tgtgtgcttc actttaaacc gtgtggatta 
cacattcaag acaaaaacaa aaagaagctc 
ttgtggggca tagatcctgt ttcgtatgaa 
cacctgagaa aggccaagct ggatgaagac 
gacgtgcctg tggcccagga gaccgtgcag 
atcaacaccc ggccccccaa ctctgcccag 
ctcaacgagc tggagacagg catggagaag 
cctgacatcc gcggcatgga gaatggcaac 
ctggaggaga agcagagaga agcacggagg 
acgaggtggt tctacccagg caataacccc 
ggggattact ttgagcggaa tttctccgac 
gcctggggcc cgggaccgga ggctgacgag 
ctcgtcacag cagaaaccaa cttttctaac 
gatcaaggat gtaattctaa ttaactgttg 
ctcttcataa agcttcactt gggatcatcg 
tcacggcgcc cttttatgtg gcagaaatca 
tcagtcatag catgtgtagc taaaggaagt 
tgcagtggca aaaggttctg aaagcctttt 
ttgaagctaa tcctgcagag agttttatag 
aaacagaagt gaagagtttc agagcatcag 
ttttaacaat cttcaataat gtaaagatta 
aatagtattt tgctgaagag caagatatgc 
gaagcatccc cgtgacaaac cagagtgggc 
aagactgctt ccgcctcaga ggcgtcccgg 
cctggctgtc accgcacacc gcgtccgtgt 
gcgtgtaccc atagcactct tgtgtttctg 
gtgacaagaa tcacatccgg ttgtgtcctg 
gtggtgtaat taaagttatt taaccaaaaa 



cgtccagatc ttcagtgtct attggatttt 60 
ctgtagatgt ggatcagata cgatgattca 120 
gctggctgct gaaggatgaa cggagaggaa 180 
tctgataact cttctgggga attttcagag 240 
ttagacacca gcaaaaacaa taggattggg 3 00 
ggaattcaga aacacaggac atcgctgccg 360 
gtgtggacca tcctgaagaa gtgtgttggc 420 
gccttcaacg agcctctgag cttcttgcag 480 
ctcatccaca gggcctcctg ccagccccag 540 
tttgctgttt cggctgtggc ttcccagtgg 600 
ttgggagaaa cgtatgaatt aatcagggaa 660 
gtcagtcacc acccccccat cagtgcgttc 720 
ttccacggct ccatctaccc caagctcaag 780 
cgaggcacca tcaccctgga gctgctcaaa 840 
acctgctgcg tccacaacgt catcatcggg 900 
gagattttaa accacagaac tggacataag 960 
tttggaaaag aacttcacaa ggtggaagga 1020 
tttatgatct atggcaaacg gacggaatgc 1080 
tccttcaaga agcaggagag gagaggtgac 1140 
tccgggaagg ctgacagcga cgtggctgac 1200 
gtcattcctg gcagcaagct gctctggagg 1260 
atgtataatt tcaccagttt cactgtgagc 1320 
accctgccac ccacggactg ccgcctgcgc 1380 
atggatctgg ccagccagga gaaggagcgg 1440 
gagcgggcca aggaggaggc agagtggcag 150 0 
tacactggga cccccgactg gttgtatgca 1560 
tgcccagata tctactgagg gcctggaggg 1620 
gctggacttc ctcgagtggc cactgtgagc 1680 
gactgagttc gcggagatag- catcatccct 1740 
attgccaaac atttcactct gctgtgccgt 1800 
tcttcattaa ggtttcaaca gggaaattct 1860 
gctggggctt gtttagcttc cagcacactc 1920 
aatgggaagg ggttcatgtt ctctttataa 1980 
aaactcgaac cagtggggga aagatggatc 2040 
aggccaggga ttgccttcta aattatgaca 2100 
attgagtgaa aagttgtcag attctgtatt 2160 
cttttaaaat atttaagtta aaactacttg 2220 
attaatcacc ggttttatac tgtccaaaat 2280 
agaagcatcg agagcgtgac aggaaatccc 2340 
ctgcgattcg ctgccctgtt gtcagtgagg 2400 
ctccaggggg ttcctttctt ctcacacgtc 2460 
tttttcccag tatgcatgtt taaaatagaa 2520 
tgggagggtc agaggcagaa tctacttaca 2580 
aaaaaaaaaa 2 62 0 



<210> 15 

<211> 2426 

<212> DNA 

<213> Homo sapiens 



14 



WO 00/49043 



PCT/US00/04160 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2454393CB1 

<400> 15 

gaaacactag acatgatacg aattcgagct 
agacaagatg gcgacgtccg tggggcaccg 
gtggcggagc agcctccatc cctgtgagat 
acggaagctg ccttttagag cctttcgcac 
ccgaaccatg ttcctgctgc gtcccctgcc 
agggtaccgg cagtatgaga agtacaggga 
tccacccaaa cttgctggtc actgggaggt 
gctgtcacgg gcctggggtc gcctcaatca 
cgtctacagc ctgtacatct ggacgtttgg 
cctgcatcac taccgcaacc tcagcgagtt 
gcctgtctgc ggcctgcaca gcgtgattag 
gcaggtgaag aactgtgagg tggagcaggt 
cctgggcccg cgtatgtgca cagaggaccc 
cttcaagaac cagctggtca cccgggaagg 
ggcccctggg gactaccact gcttccactc 
ccacttccca ggctccctga cgtcagtgaa 
cttctgccat aacgagcggg tggtcctgac 
gacagctgtg ggggccacca acgtgggctc 
cacaaacagc ccaaggcaca gcaagggctc 
caatagagag ggcgccccca tgcgtaaggg 
caccatcgtg ctcatcttcg aggcccccaa 
gaaaatccgc tttggggaag ccctgggctc 
ctaagggatc ttttccaaac agagtgaggg 
ccaggtaagg gcctgcctca gcgtggttgg 
ggctaccacc tgttccagag gcgcagacaa 
aacctatccc gttccttctg cctacaaata 
atttggatct ttttaaacac tgtatagacg 
gttggtctcg gttgtcgttt tttcttgctg 
gttggcccct tttgtttttt ggccttaacg 
ggcacagact gtggatgcct ctcccccagc 
tccgttccca cattcacctc atcctgctct 
gaacgcatca ccctggactc tgctagtgtc 
atacagaccc ttgcaggacc tgatgagtga 
caggcaagac caatcaatca ctgaacctgc 
ggtagtaggc ccagctgggg aaatgggaga 
gaaatatggg aaagttgctg ctattgattc 
gtggatagtt ggtcagtgcc tggaaaacct 
gagcccccag tggcggggac agggccagat 
tctcctgcag gagagacatc attgaatttt 
tatgaaaagt gggagacttc tgaacagtaa 
aaaaaatcaa acttaaaaaa aaaaaa 



cgtacccttc gaggcacgct gagaaggagc 60 
atgtctggga ttactgcacg gggtcgcgcc 120 
cactgccctg agccaatccc tacagccctt 180 
agatgccaga aaaatccaca ctgcccctgc 240 
cattctgttg gtgacaggcg gcgggtatgc 300 
gcgagagctg gagaagctgg gattggagat 3 60 
ggctttgtac aagtcagtgc caacgcgctt 420 
ggtggagctg ccacactggc tgcgcaggcc 480 
ggtgaacatg aaagaggccg ctgtggagga 540 
cttccggcgc aagctgaagc cgcaggcccg 600 
cccatcggat ggaaggatcc tcaactttgg 660 
aaagggggtc acctactccc tggagtcgct 72 0 
gcccttccca ccagccgcgt cgtgtgactc 780 
gaatgagctc tatcacngtg tcatctacct 840 
ccccaccgac tggactgtgt cccaccggcg 900 
ccctggcatg gctcgctgga tcaaagagct 960 
gggggactgg aaacatggct tcttctcact 1020 
cattcgcatc tactttgacc gggacctgca 1080 
ctacaatgac ttcagcttcg tgacgcacac 1140 
cgagcacctg ggcgagttca acccgggctc 1200 
ggacttcaat ttccagctga aaacaggaca 1260 
gctctagagt ctctttcctg attatggctg 1320 
tcttttcaag agggaggccc atgaggccat 1380 
gagtctgacc aggtaggact tgaatgattc 1440 
gaggtggcga gagcccccat catgcccctc 1500 
aaaagtgcag gctggaatga tctcagtcac 1560 
gaagagcctg cattcctgac cgaaccttca 1620 
ctcctccccc catcacctga gctgttttct 1680 
ctcctgctgc acagggtgag gtacctcctt 1740 
agagccacac agccttcgtg acaactgctt 1800 
ttagaaaaag cagtctttgt gcttgtggct 1860 
ttctgaggac actgatgaca ctgattaatg 1920 
cccttctgga gctggccagg tcctctgcag 1980 
ctcatggcac cagagtgaac agggcaggca 2 04 0 
gttcctgtcc ccctccacat atccctacat 2100 
agggtctgtc ttggaggcag aggacccttg 2160 
gtcccagttt atcaggaacg caggcctggg 2220 
ttcatgttga ccctggggat gctgtgaatt 2280 
ttcaactgta tcagtagcac agtatttttg 2340 
ttcatttaat tgcaaagcat tttgaaataa 2400 
2426 



<210> 16 

<211> 3705 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2766980CB1 

<400> 16 

ggccgcagga gcagtaggtg ttagcagctt 
gggacctgtg acagggctgg tagcagcgca 
agtgtctgta gacaagatgg aatcatctcc 
attgagggta acagccaaag aactttctct 
ggaaatattc tccaagtacc agaaagcagc 
taacaccgaa aatctctccc agcactttag 
gtgggagaac ccagggctgg gagcagagtc 
tgagattagg cacagagcag accatcctcc 
agccaaagct gaccaagaag aacaaatcca 
agccctcgtt cagggtcgat acccccacat 



ggtcgcgaca ggtgcgctag gtagagcgcc 60 
gaggaaaggc ggcttttagc caggtatttc 120 
atttaataga cggcaatgga cctcactatc 180 
tgtcaacaag aacaagtcat cggctattgt 240 
tgaagaaaca aacatggaga agaagagaag 3 00 
aaaggggacc ctgactgtgt taaagaagaa 3 60 
tcacacagac tctctacgga acagcagcac 420 
tgctgaagtg acaagccacg ctgcttccgg 480 
ccccagacct agactcaggt cacctcctga 540 
caaggacggt gaggatctta aagaccactc 600 
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aacagaaagt aaaaaaatgg aaaattgtct aggagaatcc aggcatgaag tagaaaaatc 660 
agaaatcagt gaaaacacag atgcttcggg caaaatagag aaatataatg ttccgctgaa 72 0 
caggcttaag atgatgtttg agaaaggtga accaacccaa actaagattc tccgggccca 780 
aagccgaagt gcaagtggaa ggaagatctc tgaaaacagc tattctctag atgacctgga 840 
aataggccca ggtcagttgt catcttctac atttgactcg gagaaaaatg agagtagacg 900 
aaatctggaa cttccacgcc tctcagaaac ctctataaag gaccgaatgg ccaagtacca 960 
ggcagctgtg tccaaacaaa gcagctcaac caactataca aacgagctga aagccagtgg 102 0 
tggcgaaatc aaaattcata aaatggagca aaaggagaat gtgcccccag gtcctgaggt 1080 
ctgcatcacc catcaggaag gggaaaagat ttctgcaaat gagaatagcc tggcagtccg 1140 
ttccacccct gccgaagatg actcccgtga ctcccaggtt aagagtgagg ttcaacagcc 12 00 
tgtccatccc aagccactaa gtccagatcc cagagcctcc agtctttctg aaagttctcc 12 60 
tcccaaagca atgaagaagt ttcaggcacc tgcaagagag acctgcgtgg aatgtcagaa 1320 
gacagtctat ccaatggagc gtctcttggc caaccagcag gtgtttcaca tcagctgctt 1380 
ccgttgctcc tattgcaaca acaaactcag tctaggaaca tatgcatctt tacatggaag 1440 
aatctattgt aagcctcact tcaatcaact ctttaaatct aagggcaact atgatgaagg 1500 
ctttgggcac agaccacaca aggatctatg ggcaagcaaa aatgaaaacg aagagatttt 1560 
ggagagacca gcccagcttg caaatgcaag ggagacccct cacagcccag gggtagaaga 1620 
tgcccctatt gctaaggtgg gtgtcctggc tgcaagtatg gaagccaagg cctcctctca 1680 
gcaggagaag gaagacaagc cagctgaaac caagaagctg aggatcgcct ggccaccccc 1740 
cactgaactt ggaagttcag gaagtgcctt ggaggaaggg atcaaaatgt caaagcccaa 1800 
atggcctcct gaagacgaaa tcagcaagcc cgaagttcct gaggatgtcg atctagatct 1860 
gaagaagcta agacgatctt cttcactgaa ggaaagaagc cgcccattca ctgtagcagc 1920 
ttcatttcaa agcacctctg tcaagagccc aaaaactgtg tccccaccta tcaggaaagg 1980 
ctggagcatg tcagagcaga gtgaagagtc tgtgggtgga agagttgcag aaaggaaaca 2040 
agtggaaaat gccaaggctt ctaagaagaa tgggaatgtg ggaaaaacaa cctggcaaaa 2100 
caaagaatct aaaggagaga cagggaagag aagtaaggaa ggtcatagtt tggagatgga 2160 
gaatgagaat cttgtagaaa atggtgcaga ctccgatgaa gatgataaca gcttcctcaa 2220 
acaacaatct ccacaagaac ccaagtctct gaattggtcg agttttgtag acaacacctt 2280 
tgctgaagaa ttcactactc agaatcagaa atcccaggat gtggaactct gggagggaga 2340 
agtggtcaaa gagctctctg tggaagaaca gataaagaga aatcggtatt atgatgagga 2400 
tgaggatgaa gagtgacaaa ttgcaatgat gctgggcctt aaattcatgt tagtgttagc 2460 
gagccactgc cctttgtcaa aatgtgatgc acataagcag gtatcccagc atgaaatgta 2 520 
atttacttgg aagtaacttt ggaaaagaat tccttcttaa aatcaaaaac aaaacaaaaa 2 580 
aacacaaaaa acacattcta aatactagag ataactttac ttaaattctt cattttagca 2 640 
gtgatgatat gcataagtgc tgtaaggctt gtaactgggg aaatattcca cctgataata 2700 
gcccagattc tactgtattc ccaaaaggca atattaaggt agatagatga ttagtagtat 2760 
attgttacac actattttgg aattagagaa catacagaag gaatttaggg gcttaaacat 2 820 
tacgactgaa tgcactttag tataaagggc acagtttgta tatttttaaa tgaataccaa 2 880 
tttaattttt tagtatttac ctgttaagag attatttagt ctttaaattt tttaggttaa 2940 
ttttcttgct gtgatatata tgaggaattt actactttat gtcctgctct ctaaactaca 3 000 
tcctgaactc gacgtcctga ggtataatac aacagagcac tttttgaggc aattgaaaaa 3 060 
ccaacctaca ctcttcggtg cttagagaga tctgctgtct cccaaataag cttttgtatc 3120 
tgccagtgaa tttactgtac tccaaaCgat tgctttctct tctggtgata tctgtgcttc 3180 
tcataattac tgaaagctgc aatattttag taataccttc gggatcactg tcccccatct 3240 
tccgtgttag agcaaagtga agagtttaaa ggaggaagaa gaaagaactg tcttacacca 3300 
cttgagctca gacctctaaa ccctgtattt cccttatgat gtcccctttt tgagacacca 3360 
atttttaaat acttactagc tctgaaatat attgatttct atcacagtat tctcagggtg 3420 
aaattaaacc aactataggc ctttttcttg ggatgatttt ctagtcttaa ggtttgggga 3480 
cattataaac ttgagtacat ttgttgtaca cagttgatat tccaaattgt atggatggga 3540 
gggagaggtg tcttaagctg taggcttttc tttgtactgc atttatagag atttagcttt 3 600 
aatatttttt agagatgtaa aacattctgc tttcttagtc ttacctagtc tgaaacattt 3660 
ttattcaata aagattttaa ttaaaatttg aaaaaaaaaa aaaaa 3705 



<210> 17 

<211> 1273 

<212> DNA 

<213> Homo sapiens 

<n> 
<220> 

<221> unsure 
<222> 1237 

<223> a or g or c or t, unknown, or other 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2768356CB1 
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<400> 17 

gccaccatcc gcctatctct 
atgtgagatg gaaaaacaaa 
cccatgtgac tctgccagtt 
tccataacat cttaagcggg 
acctgcatga tgattcctgc 
ctaatgccca tgagattgaa 
tgtttgggaa atggcatgaa 
ggagagcaaa tcctatgccg 
tggaattaaa tgaaatggat 
ttaggccaga ccagaggttt 
agaggattga acaactgcag 
accagcctcg gtttttcagg 
tggaacttag aaaagatctt 
gtaaagaaga aagataacat 
acctgtgttt atatatttaa 
gcaactcttt aagtctatat 
gattgtttat atgaatgtag 
aaaatgaaag attgaatgtt 
caaaggttag ggaagattcc 
tatcaacctc tcccaggtgc 
aagctaccaa gtttaatttg 
gctcatacag aca 



gcgtgtcang ctgagtctag 
ttctggggca aatccatgga 
tttggggatc attttgagtg 
cagaggtgga ttgagcacta 
tactgcaaag tgaattttat 
ggcacagtgt ttgacaggag 
agcatctact gtggcggcgg 
aaaggctacg agcaatacta 
ccatcatcaa agtctttatt 
ctagaggaag ggaacttaga 
agagaaaggc ggcgggtctt 
aaatccgacg atgactcttg 
ggtttttcca aactggacca 
tagtgtattt ctcctgtgct 
aagatactct aggatgatca 
tttcctcagt gcgtttcttt 
aacaccttga catttctctt 
catgtgtggg ttaaaaaaag 
aattaaattt atgccttata 
antaagaaat aagaattccc 
gtagctgaaa tatcttnttg 



aaattttgtt ttctggcaag 60 
aattgttcca attggcacaa 120 
gaacaaagtg acctcttgca 180 
tggagagatt gtcatcaaga 240 
aaaggcaaaa tactggagca 300 
tggaaaagcg gttcatcggc 360 
ctcctcctct gcctgtgtat 420 
tagcttcaca cagtttgcgc 480 
gccacctact gacactcgat 540 
agaagctgaa atacaaaagc 600 
agaagaaaat catgtggagc 660 
ggtgagcaac ggcacctatt 720 
tcctgtctta tggtgaaaaa 780 
tgccttctga agtggcacaa 840 
cttgtgctta gcttagcatt 900 
acaatttcaa atgttaccct 960 
tatatataaa ctatttaata 1020 
aagctttaac actaattttc 1080 
aaattatgtt gtagaaaaaa 1140 
agggttactc acccatgcgt 1200 
cctcagacag ctcttgaatt 1260 
1273 



<210> 18 

<211> 1733 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5324145CB1 



<400> 18 

gccaacattc taggattctg 
ggaatcgaat tttaccaggt 
aatctcaatg tgaattggta 
acggtcctgg agaatgtcct 
cccaaatttg agattgaatt 
agagacatcg caatggctac 
tctcggacct ccaacagcac 
gcctgtaaaa agatgcacat 
gtggacaacc tggtagtcgt 
aagttacggg gagagtttga 
tcgatccagc cctatcagat 
gttccatgta aactctattc 
gcaagccaag gttacctctg 
ccagacaaag gaagactcat 
ctgtctgtct gttcacagat 
actgtttttg ataaactcaa 
gcgatggcgg tggaagcagg 
acccaggcgg tgctggacca 
aagaaggaga tgcctcataa 
aaccagtttc agattgcagt 
cagcacaacc tcttttatat 
aaacctttgg cttgtctgct 
tctctggaca tgctgaagcg 
tccaaacacc aagtgttagc 
atttctgcac gaaaattttt 
acaatattcc gcttttttga 
ccaggggaac actgtgaaga 
ctaatgaggc ctacaacatt 
aagttaattt attgcattaa 



ctggactagt tcaactgaaa 
attaccagag aaacggagtc 
catgtactgc cccgagagcg 
gcagcctttt cactttaggg 
accagctgcg cctaagtcaa 
catatacggg cagctgtatg 
aggagcggag gtggtcctct 
attgaagtca aataggacgg 
gcatcatcag gatacagaga 
cggctccgtt accttccacc 
ccccatcaca ggtcctgctg 
ttcatcttgg attgtctttc 
gaacctccaa gtgaaacttg 
ggactttctc ctccagagaa 
gttaagtgag tcagacagag 
ccatgagtat aaaaagtacc 
gcagagccga agcagcccgc 
gtcagatgtg tacacccatg 
atttgtgata gccgtgctga 
acagcattac ctacatgaac 
gctgcatcag ttcctgcagt 
gttatcccca gagagtttct 
actttcaaca gcaaatgatg 
tgccttaagg tttatccggg 
agatgctgca aagcagactg 
acagcgaaac cagcgtttgc 
acatgttgct tttttcaaac 
ctgaaatcac ttgctgtttt 
taaagctcct caaactataa 



ttgtcttcat aacagatcaa 60 
tgaaactctt gaagagccac 120 
ccgtgatctt gctgtctacc 180 
ctggcactat gtcgaagctg 240 
ctaaacccag cctttccgaa 300 
ttctcttctt gaggcatcat 360 
atcatctacc acgagaaggc 420 
gaaagtttgc cctgaacgtg 480 
catcggtaat attcgatatc 540 
accccgtgct tcctgctcga 600 
ccgtgaccag ccagtctcct 660 
aacctgacat cattatcagc 720 
agcccatagt aaatctctta 780 
aggaatgcaa gatggtcatc 840 
catcgctgcc cgtgatagcc 900 
tggatgccga gcagagttat 960 
tcctcaagag gccggtgcgg 1020 
tcctgtcagc ctttgtggaa 1080 
tggaatacat tcgttctctt 1140 
ttgttatcaa aacccttgtc 1200 
accacgtcct cagcgactcc 1260 
atcctcctgc tcatcagcta 1320 
aaatagtaga agttctcctt 1380 
gcattggtgg ccatgacaac 1440 
aagacaacat gcttttctat 1500 
gagggagccc caatttcaca 1560 
agatttttgg agaccaagct 1620 
tttatataaa aatgtgtaca 1680 
aaaaaaaaaa aaa 1733 



<210> 19 
<211> 1370 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1004646CB1 



<400> 19 

gcacagcctc 

gcagctgttc 

gcccttcttc 

ggtcctggat 

cttctcgcca 

gatgtttgtg 

cgaagctccc 

catggagttc 

ccacaaggtg 

ctgggaactg 

cccgctcacc 

cggctacaac 

ggtgcaccac 

ctgggacaaa 

cggtgggtgc 

cctgagctat 

catagataaa 

atcaatacaa 

ctgcatatga 

tgtagaataa 

ttttgtggat 

tgaccaagaa 

gaaatgttat 



gcaatgagct 
ctgcagcccc 
ccggtcatct 
atcctgtgct 
tccgcgcagc 
ttccccgtga 
gagctgctcc 
ttcgtgtggc 
caccaccaga 
ttttctttgg 
accctgacct 
ttcccttggt 
gacctgcatc 
atactgggaa 
ccctaagact 
atattttttt 

ttttgtgttc 
attagacatt 
ggaattttct 
ggtgttgact 
ttatagtacc 
tcgtgcttta 



gccacaactg 
tctgggacca 
tctccatcac 
cctgggtgcc 
agctgctacc 
cgctgctgca 
tgctgctgca 
acctgctgca 
actcgtcctc 
gcrtcttcga 
tccacgtggt 
ccactcacag 
actctcactt 
cgctgcggac 
cgggactgct 
aaagcaacta 
tttttgtaat 
ttggtattct 
catcactggc 
atgtttcaaa 
tctgactcta 
ttcttatatt 
aaaaggaaaa 



ctccgacccc 
cctgaggagc 
cacatacgcg 
cgccctgcgg 
ttgcctgggg 
ttgggcccgc 
ccacatcctg 
ccacaaggtg 
gttcgcgctg 
catgatgaac 
caacatctgg 
actggtgccc 
taactgcaac 
tgcatctgtc 
gtgcctttca 
acttattgct 
ctgacaaagt 
tagtctagct 
atatttagaa 
gtgttccaaa 
aaagcaatca 
tttttataga 
aaaaccatga 



caggtccttt 
tgggaggccc 
ggcttttgcc 
cgctacaaga 
cagaccctct 
agcccggccc 
ttctgcctgc 
ccctggctgt 
gcaacgcagt 
gccacactgc 
ctttccgtgg 
ttcgggtggt 
ttcgctccgt 
ccagcgcggt 
cacttgaatg 
ttatgtttat 
aatttacata 
cacctcaata 
tatctctaaa 
acctggctaa 
aacatgtttc 
actgtatatc 
atcaaataaa 



gcagctccgg 
tcctacagtc 
tgcccttcgt 
tccaccctga 
accagcatgt 
tcctgcccca 
tactcttcga 
accgcacctt 
atatgagcgt 
tcgggtgcca 
aggaccactc 
acgggggtgt 
actttacaca 
gatgtggctg 
aagagaaaca 
ctatgaaaac 
ctgtttgtgt 
gccttgaatc 
aggacttgtt 
aagaatgtat 
tgctggacag 
tattttgaaa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1370 



<210> 20 

<211> 1264 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1802851CB1 



<400> 20 

ctcgcgtgcg gtgcgcaggg ataagagagc ggtctggaca gcgcgtggcc ggcgccgctg 60 

tggggacagc atgagcggcg gttggatggc gcaggctgga gcgtggcgaa caggggctct 120 

gggcctggcg ctgctgctgc tgctcggcct cggactaggc ctggaggccg ccgcgagccc 180 

gctttccacc ccgacccctg cccaggccgc aggccccagc tcaggctcgt gcccacccac 240 

caagttccag tgccgcacca gtggcttatg cgtgcccctc acctggcgct gcgacaggga 300 

cttggactgc agcgatggca gcgatgagga ggagtgcagg attgagccat gtacccagaa 3 60 

agggcaatgc ccaccgcccc ctggcctccc ctgcccctgc accggcgcca gtgactgctc 420 

tgggggaact gacaagaaac tgcgcaactg cagccgcctg gcctgcctag caggcgagct 480 

ccgttgcacg ctgagcgatg actgcattcc actcacgtgg cgctgcgacg gccacccaga 540 

ctgtcccgac tccagcgacg agctcggctg tggaaccaat gagatcctcc cggaagggga 600 

tgccacaacc atggggcccc ctgtgaccct ggagagtgtc acctctctca ggaatgccac 660 

aaccatgggg ccccctgtga ccctggagag tgtcccctct gtcgggaatg ccacatcctc 720 

ctctgccgga gaccagtctg gaagcccaac tgcctatggg gttattgcag ctgctgcggt 780 

gctcagtgca agcctggtca ccgccaccct cctccttttg tcctggctcc gagcccagga 840 

gcgcctccgc ccactggggt tactggtggc catgaaggag tccctgctgc tgtcagaaca 900 

gaagacctcg ctgccctgag gacaagcact tgccaccacc gtcactcagc cctgggcgta 960 

gccggacagg aggagagcag tgatgcggat gggtacccgg gcacaccagc cctcagagac 1020 

ctgagctctt ctggccacgt ggaacctcga acccgagctc ctgcagaagt ggccctggag 1080 

attgagggtc cctggacact ccctatggag atccggggag ctaggatggg gaacctgcca 1140 

cagccagaac tgaggggctg gccccaggca gctcccaggg ggtagaacgg ccctgtgctt 1200 

aagacacccc tgctgccccg tctgagggng gcgattaaag tcgcttcaca tcctcaaaaa 12 60 

aaaa 1264 



<210> 21 
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<211> 2566 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2764333CB1 

<400> 21 

cccacgcgtc cgacgagttc tatgatgcgc tgtcagattc cgagtccgaa aggtccctga 60 

gtagattgga agcagtgaca gcacgctcct ttgaagagga aggagagcat ttgggcagta 120 

gaaaacacag aatgtccgaa gaaaaagact gtggtggcgg agatgctctc tccaatggca 180 

tcaagaaaca cagaacaagt ttgccttctc ctatgttttc cagaaatgac ttcagtatct 240 

ggagcatcct cagaaaatgt attggaatgg aactatccaa gatcacgatg ccagttatat 300 

ttaatgagcc tctgagcttc ctacagcgcc taactgaata catggagcat acttacctca 360 

tccacaaggc cagttcactc tctgatcctg tggaaaggat gcagtgtgta gctgcgtttg 420 

ctgtatctgc tgctgcttct cagtgggaac ggactggaaa acctttcaac ccactgctgg 480 

gagagactta tgaattagtg cgagatgacc ttggatttag actcatctcc gaacaggtca 540 

gccatcaccc accaatcagt gcatttcang ctgaaggatt aaacaatgac ttcatctttc 600 

atggctctat ctatcccaaa ctgaaattct gggggaagag tgtagaagca gaacccaaag 660 

gaaccatcac cttggagctc cttgaacaca atgaggcata tacatggaca aatcccacct 720 

gctgtgtgca taatatcatt gtgggtaaac tgtggatcga acagtatggc aatgtggaaa 780 

ttataaacca caagactggg gacaaatgtg tgttgaattt taagccatgt ggcctttttg 840 

gtaaggaatt acacaaagtt gaaggctaca ttcaagataa aagcaaaaag aagctctgtg 900 

ccctctatgg gaagtggact gaatgtttat acagtgttga ccctgccacg tttgacgctn 960 

acaaaaaaaa tgataagaaa aatacagaag agaagaagaa cagcaaacag acgagcacct 1020 

ctgaggagtt ggatgaaatg ccagtgccgg attctgaaag tgtattcatt atccctggaa 1080 

gcgttcttct atggcgaata gccccacggc ctccaaattc tgcccagatg tataatttta 1140 

ctagttttgc aatggttttg aatgaagtag acaaagacat ggagagtgtg attcccaaga 1200 

cagactgcag gttacggcct gacatcagag ccatggaaaa tggagagata gatcaagcta 12 60 

gtgaagaaaa aaaacgactt gaggaaaaac aaagagcagc ccgcaaaaac aggtccaagt 1320 

cagaagagga ctggaagacg aggtggttcc atcaaggtcc taatccctac aatggagcac 13 80 

aggactggat ttactctggc agctactggg acagaaatta cttcaatttg cctgacattt 1440 

attaaaatgc atacaagtca gggtgtttgg ctaatctaca aataagtctt aaacctatgt 1500 

ttttaaattt ttttcccttg gtttctactt atcttttaaa aaaaaaaatg aaaaaacact 1560 

catgagataa ctgcatttca cccaacaaaa gcagggtata aggcgatatt ggtgatgaaa 1620 

gtcttaggaa aaatgcataa ttttgctata aaatgtactt atttggaata ctattttata 1680 

tagaggtaag agaacactgc tggggaatat gctttttatg gttgctgttg ccatatttac 1740 

tgaaggttta tacctaaatg taactttagc tttatggaac tatatagtaa tcccaaatca 1800 

agttattttg aatattttta tgctgtcatg cttgaatgtt ttagatgtaa cctttgacat 1860 

atttagaact ctcctcctat acaatgttta tcctcagata tagaggttat gtcattttat 1920 

aaagacttca ttgataagat ggcttttatt catactaatc ctcccaatgt taccccttcc 1980 

atcttccaag aagaaaaaaa atgcctgaat attcagaata gatatttctg atttgaaaat 2040 

tctaaagaat taaactggaa aagtatttca tttacttagt gctctgaatt tacttttaca 2100 

gttttctgca gtcagtatca ttaaaatggt taagtttaca tttgaactga aaatatgtat 2160 

aaaatctagc aattcacaaa aatgccctag aaatatagat tttaatcacc attacataat 2220 

gacaaacctt gttaaatgct tccacttcca gtggcaaatg ccactaggga aagtaagttg 2280 

cactcatgta agtatcaaac tatataaaag gaggccttgt gcatttcaag tttgcaaagt 2340 

acctgtgtac ttaaaatatg tgtggagacc tactgtacag tagttttgcc cctttaattg 2400 

gggcacattc atcttaaatc ttatagtatt tatccaccca aaccccagac tgagatactg 2460 

ctcccagggg cctaggtagc tgccagtccg tgattttaat tgctgtcttg aagttaacaa 2520 

gtgttataat gaaataatct acctgatgct aaataaaggc tttaga 2566 



<210> 22 

<211> 1848 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2798021CB1 

<400> 22 

aaccagctga aaagcatgcg ccaagctgca 
aggctagagg aggagataaa atttaattta 
ttagaaaata agaaaaagga attacatttt 
ggccattctg atcttcttga acttgaatct 



gcagatgcaa agcctgaaag tttaatgaag 6 0 
tatatggtaa ctgaaaaatt tcctaaagaa 120 
ttacaaaaag tagtttcaga gccagctatg 180 
aaaataaatg aaataaacac agaaattaac 240 
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cagttgattg aaaagaaaat gatgagaaat gagcccattg aaggcaaact ctcactgtat 300 
aggcaacagg catctatcat ttcccgtaaa aaagaagcca aagctgagga acttcaggag 360 
gccaaggaga agttagccag cctagagaga gaagcatcag taaagagaaa tcagacccgt 420 
gaatttgatg gtactgaagt tttaaaggga gatgagttca aacgatatgt caataaactt 480 
cgaagcaaga gtacagtttt caaaaagaag catcagataa tagctgaact taaagctgaa 540 
ttcggtcttt tgcagaggac tgaagaactt cttaagcaac gtcatgaaaa tattcaacaa 600 
caactgcaaa ctatggagga gaaaaagggt atatctggat atagttacac ccaagaagag 660 
ctagaaagag tatctgcact gaagagtgaa gttgatgaaa tgaaaggacg aacattggat 72 0 
gatatgtctg aaatggtgaa aaaactgtat tcattggtat ctgaaaagaa gtcagctctt 780 
gcctcagtta taaaagagct acgacagttg cgtcaaaaat atcaagaact gacccaggag 840 
tgtgatgaaa agaaatccca gtatgatagc tgtgcagcag gcctcgaaag caatcggtcc 900 
aaattagaac aggaagttag aagactccgt gaagaatgtc ttcaagaaga aagtagacac 960 
cattatacaa attgtatgat taagaaccta gaagttcaac ttcgtcgcgc tactgatgag 1020 
atgaaggcat atatctcttc tgatcaacaa gaaaaaagaa aggcaattag ggaacagtat 1080 
accaaaaata ctgctgaaca agaaaacctt ggaaagaaac ttcgggaaaa acaaaaagtt 1140 
atacgagaaa gtcatggtcc aaatatgaaa caagcaaaaa tgtggcgcga tttggaacaa 1200 
ttaatggaat gtaagaaaca gtgctttctg aaacaacaaa gccaaacttc cattggtcag 1260 
gtaattcagg agggtgggga ggaccggcta atactgngaa ttcttgtgtc atcgtttggg 1320 
gttttacttg ataccactag ctataagcct aatctcataa tgtatttctt ttttgaaact 1380 
gatttgtata gcattttgtt ttcagaagag ccattcttta ttaagttttc atagaaaata 1440 
atgttaaggt agatttagtt tgaatgtttt ttcatatgaa aaagaggctt ttattctttt 1500 
ccatagttta gacatcactg gcgccttctg agctttatga gacaggaaac taagtttact 1560 
atctgtaaat gtaaacatat gtccattaag aaacatgtag ttttttttta gaatgtaata 1620 
acccagtggc ttactgtttt tcttaatctc ttttaaaaaa actttagaag aatcttttag 1680 
gaactaatat ctcttgttct gaagaaacat ttatctgacg ttcagcagtt cctacagttt 1740 
tacttcagtt tatttttctt ctgtaaaatg caagaaaatt taatattttg actaacatgt 1800 
cttttctgtt tgtatcattt aaaggcatat aaacttgtcg agtattaa 1848 



<210> 23 

<211> 2518 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3335404CB1 

<400> 23 

cgcagagccg gcgagggctt gcggtcctgg acggtagggg tctgcgtctg gtgccacctt 60 
ctccatcagg ctgcttgctg ggtccaccaa gcctgacccc tccagccagg aggggatcac 12 0 
ggagtgtgcc ccccgcccgg gctgctgcct gccggaaggg ggctgggaaa ccgggatccc 180 
cgcgcgccac ttgcccggca gcgtcagttt cacccaggac tggccagcgg ttccctcgtg 240 
gctctgcgcg gaggtccgcc tcctcccttc cctcccccat cccgcgcgct gcctccgcct 300 
ccttcttctg cctccaccct caacccccat ccccgcctga cgggagctag ccctcagtcc 360 
gcccgagctg tggttgtggg cgccggacaa gtccaaggcg ccccctccca atatggacag 420 
ccgctacaac agcactgcgg gcatcgggga cttgaaccag ctgagcgctg ccatcccggc 480 
cacgcgggtg gaggtgtccg tgtcctgcag aaatcttctt gacagagaca cattttctaa 540 
atctgatcca atttgtgtct tatatgtaca aggagttgga aataaagaat ggagagagtt 600 
tggaagaact gaagtaattg acaatacttt aaatcctgat tttgtaagaa agtttattct 660 
ggactacttt tttgaagaaa gagagaatct tcgttttgac ttgtatgatg ttgattcaaa 72 0 
gagccccaac ttatccaaac atgactttct gggacaagtg ttttgtacat tgggagagat 780 
cgttggttca cagggaagtc gcctggaaaa accaatagta ggaattccag ggaagaaatg 840 
tggtacaatc atacttacag cagaggaatt aaactgttgc agggatgccg ttttgatgca 900 
attttgtgcg aacaaattgg acaagaagga cttctttgga aaatcagatc ctttccttgt 960 
attttatcga agtaatgaag atggcagttt tacaatttgt cacaagacag aagttgtcaa 1020 
aaacactcta aatccagtat ggcaagcatt caagatctca gtcagagcat tatgtaatgg 1080 
agactatgac agaacaatca aagtagaggt gtatgactgg gaccgagatg gaagtcatga 1140 
tttcattgga gaatttacaa caagctatag ggaactttct agagggcagt cacaattcaa 12 00 
cgtatatgag gtggtgaatc ccaaaaagaa aggaaaaaag aaaaaatata ctaattctgg 12 60 
aacagtaact ttactctctt tcttggtaga aacagaagtt tcattccttg actacattaa 1320 
gggagggacg caaatcaatt tcacagtggc tattgatttt acagcatcaa acgggaaccc 13 80 
tgctcagccc acttccctcc actacatgaa tccttaccaa ctgaatgcct atggtatggc 1440 
actaaaagca gtgggagaaa ttgttcaaga ttatgacagt gataaaatgt ttccagctct 1500 
aggatttggt gcaaaactgc ctccagatgg aaggatatct cacgaatttg ctttgaatgg 1560 
gaatcctcaa aacccctact gtgatggcat tgagggggtc atggaggctt attacaggag 1620 
tctgaaatct gtacaactat atgggcccac caactttgct cctgtaatta atcatgtagc 1680 
aagatatgct tcttctgtaa aggatggctc ccagtatttt gtgcttctga ttgttacaga 1740 
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tggtgttatc tcagatatgg cccagactaa ggagtccata gttaatgcct caaaacttcc 1800 

aatgtcaata attatagtag gtgttggacc agcagaattt gatgcaatgg tcgaatcgga 1860 

tggagatgat gtaagagtct cctctagagg aaaatatgct gaaagagaca ttgtgcagtt 1920 

tgtgccattc agggattata ttgacagaag tggaaaccac atactgagca tggctagatt 1980 

ggctaaagat gtcctagctg agatccctga gcagtttctc tcctatatga gagcccgagg 2040 

aatcaagcca tcacctgcgc ctcccccata caccccacct acacatgtgt tacagactca 2100 

aatatgactg tgctctgaaa tgctaatgtc aactacaaat caaaagtgct gagttaatgc 2160 

tttgtgcctg gtgctctgta atgaaccagg caatgagata gttttctcag tttggtttca 2220 

gcagttaatg tgctttcttg gatccaaatt taaatatctt cctaaaccaa aactgtaaat 2280 

atggttgttg catgagcaac agaaaaaatt gtttaaatgc ttgaagcaaa gtatggatgt 2340 

cttctctaaa tctttctttc tttttttttt tttttttttt ttaacagaaa cagctaattt 2400 

ccaatgtatt gttgggaaaa agcacaaact gtgtttttaa ctcaaatatt gtcttcgact 2460 

gctatgtgta taggaaagca gtctctgtat caatgtttac atgttactac tttttaaa 2518 



<210> 24 

<211> 1120 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3735780CB1 

<400> 24 

gaaatccagt tatcaaaatt gactcaagaa gagagaacct aacagaacaa taacaatgga 60 

agaaattggg aacattatca caaagctatc atcctgccaa actccaggct cagatgtcac 120 

aggttaaaaa aagtccttca tgaaaaagaa agatcttaag cagcatgatg gattcagaag 180 

ctcatgaaaa gaggccacca atactaacat cttcaaaaca agatatatca cctcatatta 240 

caaatgttgg tgagatgaag cattacttgt gtggctgctg tgcagccttc aacaatgtcg 300 

caatcacatt tcccattcag aaggtcctct ttcgacaaca gctgtatggc atcaaaaccc 360 

gggatgcaat acttcagttg agaagggatg gatttcgaaa tttgtatcgt ggaatccttc 420 

ccccattgat gcagaagaca actacgcttg cacttatgtt tggtctgtat gaggatttat 480 

cctgccttct ccacaagcat gtcagtgctc cagagtttgc aaccagtggc gtggcggcag 540 

tgcttgcagg gacaacagaa gcaattttca ctccactgga aagagttcag acattgcttc 600 

aagaccacaa acatcatgac aaatttacca acacttacca ggctttcaag gcactgaaat 660 

gtcatggaat tggagagtat tatcgaggct tggtgcccat tcttttccgg aatggactca 720 

gcaatgtctt gtttttcggc cttcgaggtc ccattaagga gcatctgcct accgcaacga 780 

ctcacagtgc tcatctggtc aatgatttta tctgtggagg tctattgggt gccatgttgg 840 

gattcttgtt ttttccaatt aatgttgtaa aaactcgcat acagtctcag attggtgggg 900 

aatttcagtc tttccccaag gttttccaaa aaatctggct ggaacgggac agaaaaccga 960 

taaatctttt cagaggtgcc catctgaatt accatcggtc cctcatctct tggggcataa 1020 

tcaatgcaac ttatgagttc ttgttaaagg tcatatgaaa aaaccatcag ttaagtgcca 1080 

tttatcaact gaatagacct tctaagaaga aaaaaaaaaa 1120 
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